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THE SEVERE ACUTE RESPIRATORY SYNDROME CORONAVIRUS 

All documents cited herein are incorporated by reference in their entirety. 
RELATED APPLICATIONS, FROM WHICH PRIORITY IS CLAIMED 

This application incorporates by reference in its entirety US provisional patent application 
60/462,218, Attorney Reference No. PP20474.001, filed on April 10, 2003 via Express Mail with 
the US post office, US provisional patent application 60/462,465, Attorney Reference No. 
PP20480.001, filed on April 11, 2003 via Express Mail with the US post office, US provisional 
patent application 60/462,418, Attorney Reference No. PP20480.002, filed on April 12, 2003 via 
Express Mail with the US post office, US provisional patent application 60/462,748, Attorney 
Reference No. PP20480.003, filed on April 13, 2003 via Express Mail with the US post office, 
US provisional patent application 60/463,109, Attorney Reference No. PP20480.004, filed on 
April 14, 2003 via Express Mail with the US post office, US provisional patent application 
60/463,460, Attorney Reference No. PP20480.005, filed on April 15, 2003 via Express Mail with 
the US post office, US provisional patent application 60/463,668, Attorney Reference No. 
PP20480.006, filed on April 16, 2003 via Express Mail with the US post office, US provisional 
patent application 60/463,983, Attorney Reference No, PP20480.007, filed on April 17, 2003 via 
Express Mail with the US post office, US provisional patent application 60/463,971, Attorney 
Reference No. PP20480.008, filed on April 18, 2003 via Express Mail with the US post office, 
US provisional patent application 60/464,899, Attorney Reference No. PP20480.009, filed on 
April 22, 2003 via Express Mail with the US post office, US provisional patent application 
60/464,838, Attorney Reference No. PP20507.001, filed on April 22, 2003 via Express Mail with 
the US post office, US provisional patent application 60/465,273, Attorney Reference No. 
PP20518.001, filed on April 23, 2003 via Express Mail with the US post office, US provisional 
patent application 60/465,535, Attorney Reference No. PP20518.002, filed on April 24, 2003 via 
Express Mail with the US post office, US provisional patent application 60/468,312, Attorney 
Reference No. PP20480.010, filed on May 5, 2003 via Express Mail with the US post office, and 
US provisional patent application 60/473,144, Attorney Reference No. PP20480.011, filed on 
May 22, 2003, US provisional patent application 60/495,024, Attorney Reference No. 
PP20480.012, filed on August 14, 2003 via Express Mail with the US post office, US provisional 
patent application 60/505,652, Attorney Reference No. PP20480.013, filed on September 23, 
2003 via Express Mail with the US post office, US provisional patent application 60/510,781, 
Attorney Reference No. PP20480.014, filed on October 11, 2003 via Express Mail with the US 
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post office. US provisional patent application 60/529,464, Attorney Reference No. PP20480.015, 
filed on December 1 1 , 2003 via Express Mail with the US post office, US provisional patent 
application 60/536,177, Attorney Reference No. PP20480.016, filed on January 12, 2004 via 

Express Mail with the US post office, and US provisional patent application 60/ , , 

5 Attorney Reference No. PP20480.017, filed on April 7, 2004 via Express Mail with the US post 
office. 

FIELD OF THE INVENTION _ . 

The invention relates to nucleic acids and proteins from Severe Acute Respiratory 
Syndrome (S ARS) Virus. These nucleic acids and proteins can be used in the preparation and 

10 manufacture of vaccine formulations for the treatment or prevention of SARS. The invention 
also relates to diagnostic reagents, kits (comprising such reagents) and methods which can be 
used to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention also relates to methods for the treatment or prevention of SARS utilizing small 
molecule viral inhibitors and combinations of small molecule viral inhibitors and kits for the 

15 tireamentof SARS. 

BACKGROUND OF THE INVENTION 

An outbreak of a virulent respiratory virus, now known as Severe Acute Respiratory 
Syndrome (SARS), was identified in Hong Kong, China and a number of other countries around 
the world in 2003. Patients typically had symptoms including fever, dry cough, dyspnea, 
20 headache, and hypoxemia. Isolates of the SARS virus appear to have homology with at least the 
RNA polymerase gene of several known coronaviruses. A phylogenetic analysis of this 
homology is presented in Peiris et al, "Coronavirus as a possible cause of severe acute 
respiratory syndrome", Lancet, published online April 8, 2003 at 

http://image.ihelancet.coiv/extras/03art3477web.pdf, incorporated herein by reference in its entirety. 
25 Other sequenced fragments of the SARS virus genome appear to overlap with the open reading 
frame lb of coronaviruses. See, Drosten et al., "Identification of a Novel Coronavirus in 
Patients with Severe Acute Respiratory Syndrome", New England Journal of Medicine, 
published online at http://www.nepn.or8 on April 10, 2003, incorporated herein by reference in its 
entirety. 

30 The Genome Science Center in British Colombia, Canada published on its website 

(http://www.bcgsc.ca/bioinfo/SARS/) a draft genome assembly of 29,736 base pairs of a virus believed 
to be a SARS virus, referred to as the TOR2 isolate. This draft genome assembly is given herein 
asSEQIDNO:!. 

The Centers for Disease Control (CDC) published a nucleotide sequence of a SARS-CoV 
35 . strain (SEQ ID NO: 2) on its website (}ittp:/Avww.cdc.gov/ncidod/sars/pdfAtucleoseq.pdf). The CDC 

-2- 
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has also published a phylogenetic tree of the predicted N, S and M proteins (attached as FIGURE 
6). This tree places the SARS virus outside any of the previously known coronavirus groups. 

There is a growing need for prophylactic or therapeutic vaccines against the SARS virus as 
well as diagnostic and screening methods and compositions to identify the presence of the virus 
5 in, e.g., mammalian tissue or serum. 

SUMMARY OF THE INVENTION 

The invention relates to nucleic acids and proteins from Severe Acute Respiratory- 
Syndrome (SARS) virus. These nucleic acids and proteins can be used in the preparation and 
manufacture of vaccine formulations for the treatment or prevention of SARS. Such vaccine 

10 formulations may include an inactivated (or killed) SARS virus, an attenuated SARS virus, a 
spUt SARS virus preparation and a recombinant or purified subunit formulation of one or more 
SARS viral antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 

The invention also relates to diagnostic reagents, kits (comprising such reagents) and 

15 methods which can be used to diagnose or identify the presence or absence of a SARS virus in a 
biological sample. The invention further includes non-coding SARS viral polynucleotide 
sequences, SARS viral sequences encoding for non-immunogenic proteins, conserved and 
variant SARS viral polynucleotide sequences for use in such diagnostic compositions and 
methods. 

20 The invention further relates to vaccine formulations comprising one or more SARS virus 

antigens and one or more other respiratory virus antigens. Additional respiratory virus antigens 
suitable for use in the invention include antigens from influenza virus, human rhinovirus (HRV), 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 
rhinovirus. The additional respiratory virus antigen could also be from a coronavirus other than 

25 the SARS coronavmis. Preferably, the additional respiratory virus antigen is an influenza viral 
antigen. 

The compositions of the invention may further comprise one or more adjuvants. Adjuvants 
suitable for use in the invention include mucosal, transdermal or parenteral adjuvants. Mucosal 
adjuvants suitable for use in the invention include detoxified bacterial ADP-ribosylating toxins, 
30 such as E. coli heat labile toxoids (e.g. , LTK63), chitosan and derivatives thereof, and non-toxic 
double mutant forms of Bordetella pertussis toxoids. Parenteral adjuvants suitable for use in the 
invention include MF59 and aluminum or aluminum salts. 

The invention also provides methods for treating SARS by administering smaU molecule 
compounds, as well as methods of identifying potent small molecules for the treatment of SARS. 
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In one aspect of the invention a method of identi^ng a therapeutically active agent is 
provided comprising: (a) contacting the therapeutically active agent with a cell infected with the 
SARS virus; (b) measuring attenuation of a SARS related enzyme. 

In a more particular embodiment, the therapeutically active agent is a small molecule. Jn 

5 another more particular embodiment, the therapeutically active agent is a nucleoside analog. In 
another inore particular embodiment the therapeutically active agent is a peptoid, oligopeptide, 
or polypeptide. In another embodiment the SARS related enzyme is SARS protease. In another 
embodiment the SARS related enzyme is SARS polymerase. In still another embodiment the 
SARS related enzyme is a kinase. Methods of identifying therapeutically active agents for 

10 treatment of SARS virus infection are further discussed in Section V below. 

In another aspect of the invention a method of treating a human infected with SARS is 
provided comprising administering a small molecule to a patient in need thereof. In one 
embodiment the small molecule is an inhibitor of SARS protease. In another embodiment the 
small molecule is an inhibitor of SARS polymerase. In another embodiment the SARS related 

15 enzyme is a kinase. In still another embodiment the small molecule is administered orally or 
parenterally. 

The invention also provides the use of such small molecules in the manufacture of a 
medicament for the treatment of severe acute respiratory syndrome. 

Small molecule compounds of the present invention include those of less than 1000 g/mol, 

20 preferably with an aromatic region and greater than one heteroatom selected from O, S, or N. 
Preferred small molecules include, but are not limited to acyclovir, gancyclovir, vidarabidine, 
foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine, didanosine, 
zalcitabine, and combinations thereof. Interferons may also be used for treating patients, 
including interferon-a and interferon-p. Interferon treatment has shown promise in treating 

25 SARS in monkeys (Enserink (2004) Science 303: 1273-1275), particularly when pegylated 
(Haagmans et al. (2004) Nature Medicine 10:290-293). 

One aspect of the present invention relates to methods for identifying individuals exposed 
to, and biological samples containing SARS virus (SARSV), and to kits for carrying out the 
methods. Such methods can utilize nucleic acid detection techniques such as PGR, RT-PCR (the 

30 Coronaviridae are RNA viruses), transcription-mediated amplification (TMA), ligase chain 
reaction (LCR), branched DNA signal amplification assays, isothermal nucleic acid sequence 
based amplification (NASBA), other self-sustained sequence replication assays, boomerang 
DNA amplification, strand-displacement activation, cycling probe technology, or combinations 
of such amplification methods. Such nucleic acid detection techniques utilize oligonucleotides 

35 having nucleotide sequence similar to, or complementary to, the SARS viral genome, as primers 
ie.g.y for amplification) and as probes {e.g., for capture or detection), as is well known in the art. 
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Alternatively, or in addition to the nucleic acid detection methods described supra, the 
methods of the present invention can utilize various immunoassay techniques for detection of 
SARSV antigens and/or antibodies. 

Accordingly, the present invention relates to methods of identifying individuals exposed to 
5 SARSV, or biological samples containing SARSV, by detecting the presence of SARSV 
antigens using antibodies which specifically bind to the same. The antibodies are preferably 
monoclonal antibodies- Quantification of the amount of viral antigens present in a sample of an 
individual may be used in detennining the prognosis of an infected individual. Preferably, the 
SARSV antigens to be detected are generally one of the structural proteins, particularly those 

10 present on the surface of the viral particles and include, for example, the spike glycoprotein (S), 
also called E2; the envelope (small membrane) protein (E), also called sM; the membrane 
glycoprotein (M), also called El; the hemagglutinin-esterase glycoprotein (HE); also called E3; 
and the nucleocapsid phosphoprotein (N). In preferred embodiments, the antigens to be detected 
are the S, E and M proteins using antibodies to the same. 

15 The present invention relates to kits for identifying individual SARSV and reagents used in 

such kits. The kits comprise a first container which contains antibodies which specifically bind to 
a SARSV antigen and a second container which contains the SARSV antigen. The antibodies are 
preferably monoclonal antibodies. The kits may be adapted for quantifying the amount of 
antigen in a sample of an individual. Such information may be used in determining the prognosis 

20 . of an infected individual. 

The present invention relates to methods of identifying individuals exposed to SARS virus, 
or biological samples containing SARSV, by detecting the presence of antibodies against SARS 
vims antigen in a sample using SARS antigen. Quantification of the amount of anti-SARS 
protein from SARS antibodies present in a sample of an individual may be used in determining 

25 the prognosis of an infected individual. Any one or more of the viral proteins (structural proteins 
or nonstructural proteins) may be used as antigen to detect the SARSV antibodies; preferably a 
SARSV antigen that is conserved amoung SARSV isolates is preferred. In this regard, 
nonstructural protein (e.g., Pol, Hel, 3CLp, MP, PLPl, PLP2) may be particularly useful. 
The present invention relates to kits for identifying individuals exposed to SARS and 

30 reagents used therein. The kits comprise a first container which contains antibodies which were 
produced in response to exposure to an antigen from SARS virus and a second container which 
contains the SARS antigen(s). The kits may be adapted for quantifying the amount of anti-SARS 
antibodies present in a sample of an individual. Such information may be used in determining the 
prognosis of an infected individual. 

35 The present invention relates to methods of identifying individuals exposed to SARS virus, 

or biological samples containing SARSV, by detecting the presence of nucleic acid from SARS 
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viras. Quantification of the amount of S ARS nucleic acid present in a sample of an individual 
may be used in determining the prognosis of an infected individual. The methods utilize 
oligonucleotide probes and/or primers that are similar or complementary in sequence to the 
SARS V genome or transcription or replication products. Preferred probes and primers are 

5 described herein. Also included in the present invention are kits for carrying out the methods of 
detecting the SARSV nucleic acid. 

The invention further includes a method for the treatment and/or prevention of SARS 
through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patents and published intemational patent 

10 applications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compound is a small molecule. In another embodiment, the antiviral compound is a protease 
inhibitor. In a farther embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound is an 
inhibitor of an RNA-dependent RNA polymerase. In another embodiment, a first antiviral 

15 compound which is a protease inhibitor is administered with a second antiviral compound which 
is an RNA-dependent RNA polymerase inhibitor. The invention further provides for the 
administration of a steroidal anti-inflammatory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table 1 and Table 2. 

20 The invention further provides for a method for the treatment and/or prevention of SARS 

through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patents and published intemational patent 
applications listed in Table 1 and Table 2 by inhalation. In one embodiment of the method, the 
antiviral compound is a small molecule. In another embodiment, the antiviral compound is a 

25 protease inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease 
inhibitor and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound 
is an inhibitor of an RNA dependent RNA polymerase. In another embodiment, a first antiviral 
compound which is a protease inhibitor is administered with a second antiviral compound which 
is an RNA-dependent RNA polymerase inhibitor. The invention further provides for the 

30 administration of a steroidal anti-inflanamatory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table 1 and Table 2 by inhalation. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administered for systemic absorption such as via an oral or 
intravenous route. 

35 The invention further provides the use of an antiviral compound, as defined above, in the 

manufacture of a medicament for tiie treatment of severe acute respiratory syndrome. 
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The invention further provides for a kit for use by a consumer for the treatment an 
prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral compound from among those described 
in the US Patents and published international patent applications listed in Table 1 and Table 2 
5 and a pharmaceutically acceptable carrier, vehicle or diluent; (b) a container for holding the 
pharmaceutical composition; and, optionally; (c) instructions describing a method of using the 
pharmaceutical compositions for the treatment and or the prevention of SARS. The kit may 
optionally contain a plurality of antiviral compounds for the treatment of SARS wherein the anti 
viral compounds are selected from 3C-like protease inhibitors and papain-like protease 

10 inhibitors. Jn a further embodiment, the kit contains an antiviral compound which is an RNA- 
dependent RNA polymerase inhibitor. When the kit comprises more than one antiviral 
compound, the antiviral compounds contained in the kit may be optionally combined in the same 
pharmaceutical composition. 

An additional aspect of the invention provides for the use of at least one of the antiviral 

15 compounds described in the US Patents and published international patent applications listed in 
Table 1 and Table 2 for the manufacture of a medicament for the treatment or prevention of 
SARS. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1: Schematic of coronavirus genome organization. 

20 FIGURE 2: Schematic of coronavims ORFla/ORFlb gene products. 

FIGURE 3 (A - C): Alignment of coronavirus polynucleotide sequences for selected genes 
(including nucleocapsid (N), matrix (M), and hemagluttinin-esterase (HE)). 
FIGURE 4 (A - F): Alignment of coronavirus polypeptide sequences (including ORFla/ORFlb, 
nucleocapsid (NP), hemagluttinin-esterase (HE), envelope (Sm or E), matrix (M), and spike (S). 

25 FIGURE 5: Alignment of spike (S) polypeptide sequences, taken from Figure 4, in the region of 
the junction of the SI and the S2 domains, and protease cleavage site for selected coronaviruses. 
FIGURE 6: CDC phylogenetic tree of SARS-CoV strain (Clustalx 1.82, neighbor-joining tree). 
Figure 6A shows coronavirus N protein analysis. Figure 6B shows coronavims S protein 
analysis, and Figure 6C shows coronavirus M protein analysis. 

30 FIGURE 7: Conserved and specific sequence of the SARS virus. Figures 7A-7D show multiple 
sequence alignments (CLUSTAL W 1.82) of the structural proteins of the SARS virus genome 
(7 A: PEP4 Spike protein; 7B: PEP7 small membrane protein; 7C: PEPS matrix glycoprotein; 7D: 
PEP13 nucleocapsid protein), which have counterparts in all or some of the other known 
coronaviruses. Figures 7E-7H show dendrograms reporting the protein distances among the 
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sequences in alignments lA-TD. Labels 229E: human coronavirus; MEV: murine hepatitis viras; 
TGV: transmissible gastroenteritis virus; AIBV: avian infectious bronchitis virus; BOVINE: 
Bovine coronavirus; PEDV: porcine epidemic diarrhea virus. 

FIGURE 8: Alignment of the SUTR of several coronaviruses, to show consensus nucleotide 
5 sequence at the STJTR. 

FIGURE 9: Sequences of preferred primers for amplification of the 5'UTR. F and R denote 
forward and reverse PGR primers, and the numbers indicate nucleotide positions withing.Figure 
8. 

FIGURE 10: Alignment of the STJTR of several coronaviruses, to show consensus nucleotide 
10 sequence at the STJTR. 

FIGURE 1 1 : Sequences of preferred primers for amplification of the STJTR. F and R denote 
forward and reverse PGR primers, and numbers indicate nucleotide positions within Figure 10. 
FIGURE 12: Coiled-coil prediction for SEQ ID NO: 6042, using Coils program (Figure 12A) or 
LearrCoil (Figure 12B). 

15 HGURE 13: Example of insertion of a reporter gene-of-interest at a site between exisiting SARS , 
virus genes. Small nonstructural gene products are not depicted schematically. 

FIGURE 14: Schematic depicting representative examples of SARS virus replicons. Small 
nonstmctural gene products are not depicted schematically. 

FIGURE 15: SARS virus nsp2 proteinase (SCLp) and identification of catalytic and substrate 
20 sites. 

FIGURE 16: alignment of SARS virus nsp2 proteinase (SCLp) with that of avian IB V, MHV, 
and BCoV. Residues in dotted boxes are key residues the substrate sites OP. Y & H); residues in 
solid boxes are catalytic cysteine (C) and histidine (H) residues. 

FIGURE 17: Genome organization of SARS coronavirus. Replicase and structural regions are 
25 shown, along with the predicted products of cleavage within ORFla and ORFlb. The position of 
the 5' RNA leader sequence (L), the 3' poly(A) tract and the ribosomal frame-shift consensus 
between ORFIa and ORFlb are also indicated. Each box represent a protein product. They are 
shaded according to the level of amino acid identity with corresponding proteins of other 
coronaviruses (see also Table 2). The SARS-specific genes are white. Positions of the 9 SARS- 
30 specific six-base IG sequences (5'-ACGAAC-S'; SEQ ID NO 7293) are indicated by arrows. 
FIGURE 18: Genome organization of Coronaviruses representative of group 1 (HCoV-229E, 
accession number. AF304460), group 2 (mouse hepatitis virus MHV, accession number: 
NC_001846), group 3 (avian infectious bronchitis viras AIBV, accession number: NC_001451) 
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and SARS coronavinis. Other completely sequenced coronaviruses used in this study are 
available at the following accession numbers: porcine epidemic diarrhea virus (PEDV), 
AF353511; transmissible gastroenteritis virus (TGV), NC_002306; Bovine coronavinis (BCoV): 
AF220295. Red boxes represent group-specific genes. The position of the leader RNA sequence 
5 and poly(A) tract is also indicated in genomes where they are reported. The position of specific 
IG sequences is indicated by circles of different shades. In the SARS genome^ we also find three 
IG sequences specific for group 2 coronavinis. . 

FIGURE 19: Topological model predicted for the spike protein anchored to the viral membrane. 
Structural and predicted functional domains are indicated. The N-terminal region (SI) is 
.0 predicted to contain the receptor binding domain. Two coiled coil regions within the S2 domain, 
partially superimposed to leucine zipper motifs are presumably involved in oligomerization. The 
hydrophobic domain is responsible for membrane anchoring. 

FIGURE 20: Phylogenetic tree obtained from the multiple sequence alignment of a 922 bp 
internal region of the pol gene from 12 coronaviruses and SARS. Numbers at the nodes represent 

L5 the result of a bootstrap analysis and strongly support the branches. Sequences not available 
within the complete coronavinis genomes have been retrieved from GenBank at the following 
accession numbers: hemagglutinating encephalomyelitis virus of swine (PHEV), AF124988, 
Human OC43 virus (OC43), AF124989, canine coronavinis (CCV), AF124986, feline infectious 
peritonitis virus (FIPV), AF124987, turkey coronavinis (TCV), AF124991, syaloacryoadenitis 

20 vims of rats (SDAV),AF124990. 

FIGURE 21: 21A . Unrooted tree obtained firom the alignment of consensus sequences of the 
group I and group n SI domain of spike proteins (Gl_cons and G2_cons) with those of a group 
3 spike (AIBV) and the spike of SARS virus. The number indicates the result of a bootstrap 
analysis. The sequences used to generate the consensus profile from group 1 are: HcoV-229E, 

25 accession number P15423; porcine epidemic diarrhea virus (PEDV), acc no: NP_598310; 

transmissible gastroenteritis virus (TGV), acc no: NP_058424; Canine coronavinis (CCV), acc 
no: S41453; porcine respiratory virus (PRV), acc no: S24284; feline infectious peritonitis virus 
(FIPV), acc no: VGIH79. The sequences used to generate the consensus profile firom group 2 
are: mouse hepatitis virus (MHV), acc no: NP_045300; Bovine coronavinis (BCoV), acc no: 

30 NP_150077; Human coronavinis OC43, acc no: P36334; hemagglutinating encephalomyelitis 
virus of swine (PHEV), acc no: AAL80031; for group 3, only the sequence of the spike protein 
of avian infectious bronchitis virus (AIBV), acc no: AA034396 was used. 21B : Schematic 
representation of cysteine positions in SI domains of group 1, 2 and 3, compared to the SARS 
spike. Horizontal bars represent the SI amino acid sequences (in the case of SARS and AIBV) or 

35 the consensus profiles (generated from group 1, Gl_cons, and fix>m group 2, G2_cons). The 
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length of the bars are not to scale. Relative cysteine positions are indicated by rectangle bars. 
Only cysteines perfectly conserved within each consensus are reported. lines connect cysteines 
conserved between the SARS SI domain and the consensus sequences as shown. 
FIGURE 22: illustration of a Neisseria Adhesin A protein (NadA). 
5 FIGURE 23: Raw translation from SARS coronavirus genome (reading frame +1). 
FIGURE 24: Raw translation from SARS coronavirus genome (reading frame +3) 
FIGURE 25: lb and Spike open reading frames, separated by *. 
FIGURE 26: SARS growth in vero cells. 

FIGURE 27: chromatogram of the capture step of SARS coronavirus on Matrix Cellufine 
10 Sulfate Superformance 150/10. Analysis was on 100ml coronoavirus harvest. The left Y axis 
shows absorbance at 280nm. The right Y axis shows the gradient (%B). The X axis shows the 
volume (ml). 

FIGURE 28: Silver-stained MCS chromatography fractions. Lanes are: (1) marker, 

(2) coronavirus vero cell harvest; (3) coronavirus vero cell harvest, after 0.65Aim filtration; 
15 (4) flowthrough; (5) wash; (6) 20% peak (virus peak). Lanes were loaded with 1 fJ^g of test 

protein. 

FIGURE 29: Westem Blot of MCS chromatography fractions. Lanes are as described for Fig.28. 
FIGURE 30: Linear density gradient ultracentrifugation, 15-60% sucrose (SW28, 2 hours, 
20000 rpm). The graph shows protein concentration (■) and sucrose concentration (♦). 
20 FIGURE 3 1 : Silver-stained density gradient fractions on NuPage 4-12% Bis-Tris-Ge (Novex), 
reduced conditions, heated for 10 minutes at 70°C. Lanes are: (1) marker; (2) 20% peak MCS; 

(3) density gradient fraction 11; (4) density gradient fraction 12; (5) density gradient fraction 13; 
(6) density gradient fraction 14; (7) density gradient fraction 15; (8) density gradient fraction 16; 
(9) density gradient fraction 17. The bulk of proteins was in fractions 15 to 17. Lanes 2, 8 and 9 

15 were loaded with Ifig protein. 

FIGURE 32: Chromatogram of the Capture Step of SARS coronavirus on MCS. Details are as 
for Figure 27, except that 200ml harvest was used. 

FIGURE 33: Silverstain (left) and Westem Blot (right) of chromatographic fractions. Lanes are 
as described for Figures 28 and 29, except that lane (6) is the 5% peak. Treatment before 
10 SDS-PAGE was at room temperature for 30 minutes. 

FIGURE 34: Density Gradient Ultracentrifugation, 15-40% sucrose (SW28. 2 hours, 20000 
rpm). The graph shows protein concentration (■) and sucrose concentration (♦). 
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FIGURE 35: Silverstain Geft) and Western Blot (right) of Density Gradient Ultracentrifiigation 
fractions on NuPage 4-12% Bis-Tris-Ge (Novex), reduced conditions. Lanes are: (1) marker; (2) 
density gradient fraction 6; (3) density gradient fraction 7; (4) density gradient fraction 8; (5) 
density gradient fraction 9; (6) density gradient fraction 10; (7) density gradient fraction 15. 
Fractions 7-10 (lanes 3-6) contained pure coronavirus proteins. The bulk of impurities was in 
fraction 15 (lane 7). Lanes 2. 8 and 9 were loaded with -Ijttg protein. Treatment before 
SDS-PAGE was at room temperature for 30 minutes. 

HGURE 36: EM pictures of Density Gradient Fractions 8-10. Figure 36A shows fraction 8; 
Figure 36B shows fraction 9; Figure 36C shows fraction 10. 
FIGURE 37: Spike/NadA fusion constructs. 

FIGURES 38 and 39: Results of the expression in Kcoli of SIl, SlL-NadA and SU-NadAAanchar. 
Figure 38 shows SDS-PAGE analysis of total lysates from BL21(DE3)/pET, BL21(DE3)/pEr- 
SIl and BL21(DE3)/pET-SlL-NadAAaBchor. The bands are indicated by an airow, and the three 
lanes are. from left to right: BL21(DE3)/pET; BL21(DE3)/pET-SlL; BL21(DE3)/pET- : 
SlL-NadAwhor. Figure 39 shows (39A) SDS-PAGE and (39B) western blot analyses of total 
lysates from BL21(DE3)/pET, BL21(DE3)/pET-SlL-NadA (grown under un-induced condition) 
and BL21(DE3)/pET-SlL-NadA (grown under induced condition). The bands are indicated by an 
arrow, and lanes are, from left to right: BL21(DE3)/pET; BL21CDE3)/pET-SlL-NadA; 
BL21(DE3)/pET-SlL-NadA. The western blot shows the presence of oligometic forms of the 
protein. 

FIGURE 40: Schematic of SARS Spike clones. 

FIGURE 41: Transient Expression of SARS Spike Proteins (western blot of COS7 cell lysate). 
Each lane of the 4-20% TG SDS gel was loaded with 20/xg ceU lysate (total 1.2mg). The labeling 
antibodies are shown. 

FIGURE 42: Western blot analyses of COS7 ceU lysates on 4% TG SDS gel showing 
oligomerization state of intracellular S molecules. 

FIGURE 43: Western blot analyses of COS7 cell lysates on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Lanes are: (1) mock, AF; (2) mock, DF; (3) nSh, 

AF; (4) nSh. DF; (5) nShATC, AF; (6) nShATC, DF. Each lane was loaded with 5/il of each 
sample, 400^1 total. The blot was labeled with antibody against the His-tagged protein. 

HGURE 44: Western blot analyses of COS7 cell medium on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Truncated spike protein is secreted. Spike 
proteins were purified from the culture medium (from a 10cm plate), first by a ConA column and 
then finally by His-tag Magnetic beads. Each lane was loaded with one third of the material. 
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FIGURE 45: Western blot analyses of COS7 cell lysates on 4-20% TG SDS gel showing 
glycoslation of SARS spike proteins. In the two left-hand blots (lanes 1-5), samples were boiled 
in SDS and p-mercaptoethanol; in the two light-hand blots (lanes 6-1 1), samples were in SDS 
only, with no boiling. Lanes 1-8 were labeled with a monoclonal raised against the His-tag 
5 protein; labes 9-11 were labeled with rabbit anti-SARS antibody. 

FIGURE 46: Effect of SARS spike protein expression on cell viability. 

FIGURE 47: Western blot analyses of COS7 cell lysates on 4% TG SDS gels showing _ 
oligomerization state of intracellular spike molecules. Blots were labeled with anti-His-tag mAb. 
The membrane fraction of COS7 cell lysate was fractionated by a sizing column before loading 
10 the lanes. Fractions 7 to 14 show bands with kDa values of: 71000, 1400; 898, 572, 365,232, 148 
and 99, respectively. 

FIGURE 48: Fractionation of cells into aqueous and detergent fractions. 
FIGURE 49: Schematic of constructs for use in OMV preparation. 
FIGURE 50: SARS HRl and HR2 constructs. 

15 FIGURE 51: Vaccine protection froms SARS in Balb/c mouse model. 

FIGURE 52: Expressed on Spike protein in transfected 293 cell lysates (52A) or COS7 cell 
culture supematants (52B). Proteins were separated on 4-20% TG SDS gels. The label was anti- 
His-tag, except for the right-hand three lanes of 52B, where the label was rabbit anti-SARS 
serum. In Figure 52A, the left-hand three lanes were treated with DTT and were boiled^ but 

20 neither treatment was used for the right-hand three lanes. In Figure 52B, no DTT was used, but 
all lanes were heated to 80°C for 5 minutes. 

FIGURE 53: Western blot of Spike proteins expressed in COS7 cells. Proteins were incubated at 
room temperamre (RT), 80°C or 100°C to check for any effect on molecular weight. FIGURE 54 
shows similar experiments on SARS virions. 

25 FIGURE 55: Results of a pulse chase experiment, showing expression and processing of SARS 
spike protein following infection with alphavirus replicon particles. Cells were treated with or 
without EndoH as shown. 

FIGURE 56: Effect of heating on Spike protein trimers. 

FIGURE 57: Coomassie blue-stained gel of yeast-expressed proteins. Lanes are: 1-See Blue 
30 Standard (lO/il); 2-pAB24 gbl (20jtig); 3-SARS Spike SI c.l gbl (20Mg); 4-SARS Spike SI c.2 
gbl (20Atg); 5-See Blue Standard (10/tl); 6-pAB24 ip (5/il); 7-SARS Spike SI c.l (5/Lil); 8-SARS 
Spike SI c.2 (5jLtl). 

FIGURES 58 to 64: Schematics of preparation of yeast expression constructs. 
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FIGURES 65 to 66: Yeast-expressed sequences for Spike. 

HGURE 67: Western blots showing expression of SARS spike protein from alphavirus replicon 
particles and replicon RNA. Figure 67A was run under non-reducing conditions and at room 
temperature (i.e. no heating), with lanes: (1) VEE/SIN-spike infection; (2) VEE/SIN-GFP 
infection; (3) Replicon-spike RNA transfection; (4) Replicon-GFP RNA transfection. Figure 67B 
was run with SARS virions at different temperatures, as shown. 

HGURE 68: induction of antibody responses in mice. Vaccine groups are: (1) Inactivated SARS 
Virus; (2) Truncated Recombinant Spike Protein; (3) Full length Spike: DNA4-DNA.PLG+ 
Alphavirus; (4) FuU length Spike: Alphavirus particles only. 

FIGURE 69: Binding of human monoclonal antibody S3. 2 to purified truncated Spike protein. 
The X-axis shows antibody concentration, and the Y-axis shows ELJSA absorbance. The 
interpolation result is 2158.13. 

HGURE 70: Geometric mean ELISA titers of antibodies induced by the SARS-CoV spike 
protein deUvered as different vaccines Qeft to right: inactivated virus; 3 fig truncated spike 
protein; ISfig DNA encoding truncated spike protein. 

HGURE 71: Neutrahzation titers after immunization with Oeft) nSdATC protein or (right) DNA 
encoding nSdATC, delivered on PLG. 

HGURE 72: Correlation between the spike antigen binding and neutralizing antibodies 
HGURE 73: Western blot of CHO cell lines expressing Spike protein in full-length form (left) or 
in tiuncated form (right). Proteins were separated by 4-12% SDS-PAGE, with boihng in DTT 
and staining by polyclonal serum. 

HGURE 74: Stouctural components of SARS-CoV spike glycoprotein and expression construct. 
L denotes leader peptide (residues 1-13), TM the transmembrane, and Cy the cytoplasmic tail 
segments. The hexa-His tags are not shown. 

HGURE 75: Western blot analysis of SARS spike proteins expressed in COS7 ceUs. In Figure 
75A, COS7 cells were tiransfected with indicated plasmid constructs and the expressed proteins 
in cell lysates 48 hr post-transfection were analysed by SDS-PAGE (4-20% polyacrylamide) in 
reducing and denaturing condtions, with proteins visualized by anti-histidine Mab. In Figure 
75B, proteins were collected from cell culture medium 48 hr post-ti-ansfection and purified first 
by a ConA column and then by His-tag magnetic beads. Purifed proteins were analysed by SDS- 
PAGE (4-20% polyacrylamide) and were visualized by anti-SARS rabbit serum. 
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FIGURE 76: Endo H sensitivity of C-terminal truncated spike protein (SA) found in cell lysate 
(lanes 1,2) and culture medium (lanes 3,4). Positions of internal SA protein and secreted SA 
protein are marked with arrow heads. 

FIGURE 77: OUgomeric status of the SARS spike protein. Recombinant S protein oligomer in 
5 COS7 cells transfected with the full-length spike construct (nSh). The cell lysates were treated 
with DTT and/or heat as indicated above each lane. The different forms of S protein in treated 
and untreated samples were visualized by SDS-PAGE (4% polyacrylamide) and Western blot 
analysis using anti-histidine MAb. 

FIGURE 78: Effect of heat denaturation on the oligomeric status of recombinant S protein in the 
10 absence of DTT. The COS7 cell lysates were heated before the electrophoresis as indicated and 
the S proteins were visualized as described fogiFigure 77. 

FIGURE 79: Effect of heat denaturation on the oligomeric status of spike protein in SARS virion 
particles. SARS-CoV were grown in Vero cells, purified and solubilized from the virion particles 
by SDS, heat-denatured as indicated and visualized as described in Figure 77, except that rablsit 
15 antiserum against the purified virus was used as a probe. 

FIGURE 80: Analysis of the oUgomeric status of SARS virion spike protein by cross-linking 
experiment. Solubilized SARS virion proteins were treated with DMS. Both untreated (-) and 
DMS treated (+) virion proteins were heat denatured in the absence of DTT and visualized by 
4% PAGE followed by silver staining. 

20 FIGURES 81 & 82: Analysis of the oligomeric status of truncated spike protein by heat 
denaturation. Truncated spike protein within COS7 cell lysates (81) or secreted into culture 
medium (82) were heat denatured as indicated in the absence of DTT and visualized by Western 
blot analysis. 

FIGURE 83: Reactivity of deglycosylated full-length spike oligomer with conformational and 
15 non-confoimational antibody. The full-length recombinant spike oligomer; was partially 

deglycosylated with PNGase F in non denaturating condition and visualized by Western blot 
analysis using anti-histidine Mab (lane 1,2,3) or rabbit antiserum against purified SARS CoV 
(lane 4,5,6). 

FIGURE 84: Localization of expressed SARS spike proteins in fractionated COS7 cell lysate 
0 visualized by western blot. Cells were tiransfected with indicated plasmids and lysed witii 

Dounce homogeniser in hypotonic buffer 48 hr post transfection. Cell lysate was centrifuged to 
obtain soluble cytosol and insoluble membrane fraction that was further solublized by 4% Triton 
X-100. Proteins were heated with SDS at 80 C and analysed by SDS-PAGE (4-20% 
polyacrylamide) in reducing condtion. Proteins were visualized by anti-histidine Mab. The 
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cytosol fractions were loaded in lanes 1, 3, and 5 and the membrane fractions were loaded in 
laiies 2, 4, and 6. 

FIGURE 85: Intracellular and surface expression of recombinant full-length (A,D) or truncated 
(B3) spike protein in COST cells. The cells were fixed at 48 hrs posttransfection and either 
treated with detergent (Cytofix/perm, BD Biosciences) for intracellular immunofluorescence 
(A3,C) or with 2% paraformaldehyde for cell surface inmiunofluorescence observation 
at x40 magnification. Mock transfected cells (C JO were included as controls. 

FIGURES 86-105: SDS-PAGE od E.coli expressed proteins. Tot = total protein; Sol = soluble 
protein fraction. Labels are protein names (Tables 26-30). 

FIGURE 106: Immunofluorescence after administration of vector encoding optimsed N antigen. 
HGURE 107: Immunofluorescence of (A) native and (B) codon-optimsed M sequences. 
FIGURE 108: Immunofluorescence of (A) native and (B) codon optimsed E sequences. 
FIGURES 109- 1 1 1 : Westem blots of Vero cells using rabbit antibodies obtained after . 
immunization with spike proteins expressed in E.coli. 

HGURE 112: Spike protein expression in 293 cells. Lanes: (M) Markers; (1) Mock transfected; 
(2,6) cells expressing nS protein, lysate; (3,7) cells expressing nSdTC protein, lysate; (4,8) cells 
expressing nS protein, supernatant; (5,9) (4) cells expressing nSdTC protein, supernatant. 
Staining antibody: (2 to 5) mouse serum obtained after DNA immunization; (6 to 9) rabbit serum, 
obtained after immunization with whole killed virus. 
HGURE 113: Six reading frames of SEQ ID NO: 9968. 
HGURE 1 14: Six reading frames of SEQ ID NO: 10033. 

HGURE 115: Alignment of bovine coronavims pol lab (top row; SEQ ID NO: 10068), avian 
infectious bronchitis pol lab (second row; SEQ ID NO: 10069), murine hepatitis virus pol lab 
(third row; SEQ ID NO: 10070), SEQ ID NO^: 9997/9998 (fourtii row) and a consensus 
sequence (bottom row; SEQ ID NO: 10071). 

HGURE 116: Schematic of coronavims genome organization. 

HGURE 1 17: Schematic of coronavims ORFla/ORFlb gene products, including region. 
HGURE 1 18: Alignment. 

HGURE 119: Alternative start codons within SEQ ID NO: 10080. 

HGURE 120: Six reading frames of SEQ ID NO: 10084. 

HGURE 121: Alignment of SEQ ID NO: 10033 and SEQ ID NO: 10084. 

HGURE 122: Reading frames in SEQ ID NO: 10084. 
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FIGURE 123: Start codon analysis for SEQ ID NO: 10084. 
FIGURE 124: BLAST analysis of SEQ ID NO: 10210. 

FIGURE 125: Epitope analysis of SEQ ID NO: 10210 by either (13A) Hopp & Woods or (13B) 

Kyte & Doolittle. 
5 FIGURE 126: Reading frames in SEQ ID NO: 10299. 

FIGURE 127: Reading frames in SEQ ID NO: 10505. 

FIGURE 128: Reading frames in SEQ ID NO: 1 1563 . 

FIGURE 129: Reading frames in SEQ ID NO: 10033. 

HGURE 130: Alignment of SEQ ID NO: 9997 and SEQ ID NO: 10033. 
10 FIGURE 131: Reading frames in SEQ ID NO: 10299. 

FIGURE 132: Reading frames in SEQ ID NO: 10505. . 

FIGURE 133: Western Blot of SARS protease purification fractions. 

FIGURE 134: Cleavage of DABCYL-EDANS (a fluorescent tagged peptide with a SARS 

protease cleavage site) by SARS protease at different concentrations. The graph shows 
15 activity/concentration correlations with no protease (4), 0.95 uM protease (■) and 2.86 uM 

protease (•). 

In the event of a discrepancy between a sequence in the sequence listing and a sequence 
in the drawings^ the drawings should take precedence. 

DETAILED DESCRIPTION OF THE INVENTION 

20 The practice of the present invention will employ, unless otherwise indicated, conventional 

methods of chemistry, biochemistry, molecular biology, immunology and pharmacology, within 
the skill of the art. Such techniques are explained fully in the literature. See, e.g.. Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., 19th Edition (1995); Methods 
In Enzymology (S. Colowick and N. Kaplan, eds.. Academic Press, Inc.); and Handbook of 

25 Experimental Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell, eds., 1986, Blackwell 
Scientific Publications); Sambrook, et al. Molecular Cloning: A Laboratory Manual (2nd 
Edition, 1989); Handbook of Surface and Colloidal Chemistry (Birdi, K.S. ed., CRC Press, 
1997); Short Protocols in Molecular Biology, 4th ed. (Ausubel et al. eds., 1999, John Wiley & 
Sons); Molecular Biology Techniques: An Intensive Laboratory Course, (Ream et al., eds., 1998, 

30 Academic Press); PCR (Introduction to Biotechniques Series), 2nd ed. (Newton & Graham eds., 
1997, Springer Verlag); Peters and Dalrymple, Fields Virology (2d ed). Fields et al. (eds.), B.N. 
Raven Press, New York, NY. 
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All publications, patents and patent applications cited herein, are hereby incorporated by 
reference in their entireties. 

Severe Acute Respiratory Syndrome (SARS) virus has recently been identified as a new 
viral species. The SARS viral species includes the following isolates. 

- two virus isolates described in Peiris et al "Coronavirus as a possible cause of severe acute 
respiratory syndrome" Lancet published online at 

htp://image.thelancet.com/extras/03art3477web.pdf on April 8 2003, incorporated herein by 
reference in its entirety and the sequences deposited with GenBank at accession number 
AY268070. 

- the isolates and viral sequences described in Drosten et al., "Identification of a Novel 
Coronavirus in Patients with Severe Acute Respiratory Syndrome", New England Journal 
of Medicine, published online at http://vmw.nejm.org on April 10, 2003. 

- the isolates and viral sequences described on the website of the WHO network on March 25 
and 24, 2003. 

- the isolates and viral sequences described in Tsang et al., "A Cluster of Cases of Severe 
Acute Respiratory Syndrome in Hong Kong" New England Journal of Medicine, published 
online at http://www.nejm.oTg on March 31, 2003. 

- the isolates and viral sequences described in Poutanen et al. , "Identification of Severe 
Acute Respiratory Syndrome in Canada" New England Jourrud of Medicine, published 
online at http://www.nejm.org on March 31, 2003. 

As described in the Lancet article, a 646 base pair polynucleotide from the SARS virus has weak 
homology to viruses of the family Comoaviridae. The Lancet article further reports that a 
deduced amino acid sequence (of 215 amino acids) from this sequence has about 57% sequence 
homology to the RNA polymerase of bovine coronavirus and murine hepatitis virus. 
Phylogenetic analysis of the protein sequences are also presented in the Lancet article showing 
that the polymerase sequence is most closely related to the group U coronaviruses. 

Additional SARS viral isolates can be identified, isolated and/or sequenced by virologists 
skilled in the art. Virologists can readily identify new viral isolates as a SARS virus. Criteria 
which a virologist may use to identify new SARS isolates include: sequence homology of the 
new isolate to known SARS viral isolates; similar genomic organization of the new viral isolate 
to known SARS viral isolates; immunological (serologic) similarity or identity with known 
SARS viral isolates; pathology; and similarity of virion morphology with known SARS viral 
isolates; and similarity of infected cell morphology as that caused by known SARS viral isolates 
(visualized, for instance, by electron microscopy). 

Methods for isolating and sequencing SARS viral isolates include the methods described 
by Peiris et al. in the Lancet paper. As reported in the Lancet paper, RNA fmm clinical samples 
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can be reverse transcaibed with random hexamers and cDNA can be amplified with primers 
having sequences of SEQ ID NOS: 6584 & 6585 in the presence of 2.5 mmol/L magnesium 
chloride (94°C for 1 min, 50°C for 1 min, and 72"*C for 1 min). 

Reverse transcription of a viral isolate using random hexamers can be accomplished in an 
5 RT-PCR assay as follows. Virus isolates are propagated on mammalian cells, particularly fetal 
rhesus kidney cells. Total RNA fi»m virus-infected and virus-uninfected fetal rhesus kidney 
cells is then isolated. RNA samples are reverse transcribed with a primer having SEQ ID NO: 
6586. cDNA can be amplified by a primer having SEQ ID NO: 6587. Unique PGR products (in 
size) in the infected cell preparation are then cloned and sequenced, and genetic homology of the 
10 sequence compared with those in GenBank. 

One skilled in the art would be able to identify and clone additional genomic regions using 
a variety of standard cloning techniques, such as, for example, using random primer RT-PCR 
and detection of sequences overlapping one or more of the above sequences, and/or using 
oligonucleotide primers, degenerate primers, based on the sequences provided herein (see 
15 Figures 1-5, Figures 8-11, SEQ ID NOS: 3-20). 

Cloning, sequencing and identification of SARS virus by one skilled in the art can be 
further faciUtated by the use of polynucleotide sequences, particularly RNA polymerase 
sequences, from related Coronaviruses. 

Sequence homology of new viral isolates with the known SARS isolates described above 
20 can be readily determined by one skilled in the art. New SARS isolates may be identified by a 
percent homology of viral nucleotide sequences of 99%, 95%, 92%, 90%, 85%, or 80% 
homology of the new virus to known SARS viral polynucleotide sequences. New SARS isolates 
may also be identified by percent homology of 99%, 95%, 92%, 90%, 85%, or 80% homology of 
: the polypeptides encoded by the polynucleotides of the new virus and the polypeptides encoded 
25 by known SARS virus. 

New SARS isolates may also be identified by a percent homology of 99%, 95%, 92%, 
90%, 85%, or 80% homology of the polynucleotide sequence for specific genomic regions for 
the new virus with the polynucleotide sequence for specific genomic regions of the known SARS 
viruses. Additionally, new SARS isolates may be identified by a percent homology of 99%, 
JO 95%, 92%, 90%, 85%, or 80% homology of die polypeptide sequence encoded by the 

polynucleotide of specific genomic regions of the new SARS virus to the polypeptide sequence 
encoded by the polynucleotides of specific regions of the known SARS virus. These genomic 
regions may include regions (e.g., gene products) which are typically in common among 
numerous coronaviruses, as well as group specific regions (e.g., antigenic groups), such as, for 
15 example, any one of the following genomic regions which could be readily identified by a 
virologist skilled in the art: 5'untranslated region (UTR), leader sequence, ORFla, ORFlb, 
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nonstructural protein 2 (NS2), hemagglutinin-esterase glycoprotein (HE) (also referred to as E3), 
spike glycoprotein (S) (also referred to as E2), ORF3a, ORF3b, ORF3x, nonstructural protein 4 
(NS4), envelope (smaU membrane) protein (E) (also referred to as sM), membrane glycoprotein 
(M) (also referred to as El), ORFSa, ORF5b, nucleocapsid phosphoprctein (N). ORF7a, ORF7b, 
intergenic sequences, 3'UTR, or RNA dependent RNA polymerase (pel). The SARS vims may 
have identifiable genomic regions with one or more the above-identified genomic regions. A 
SARS viral antigen includes a protein encoded by any one of these genomic regions. A SARS 
viral antigen may be a protein or a fragment thereof, which is highly conserved with 
coronavimses. A SARS viral antigen may be a protein or firagment thereof, which is specific to 
the SARS virus (as compared to known comaviruses). (See, Figures 1-5, Figures 8-11, SEQ BO 
NOS:3-20). 

One skilled in the art could also recognize electron microscopy of a SARS virus infected 
manmialian cell. Electron mdcroscopy of SARS infected cells are shown in the Lancet paper. 
As discussed in the paper, electron.microscopy of negative stained (3% potassium phospho- 
tungstate, pH 7.0) ultracentrifuged cell-culture extracts of SARS infected fetal rhesus kidney 
cells show the presence of pleomorphic enveloped virus particles of around 80-90 mn (range 70- 
130 nm) in diameter with surface morphology compatible with a coronavirus (see Lancet paper, . 
Figure 1). Thin-section electron microscopy of infected cells reveals virus particles of 55-90 nm 
diameter within smooth walled vesicles in the cytoplasm (see Lancet paper. Figure 2B) 
Electron microscopy can also be used to observe virus particles at the cell surface. Electron 
microscopy of a human lung biopsy sample depicts similar viral morphology. See Lancet paper 
Figure 2A. 

/. SARS POLYPEPTIDES AND POLYNUCLEOTIDES 

The invention relates to nucleic acids and proteins fi-om SARS virus. Such polynucleotides 
and polypeptides are exemplified further below. 

In one embodiment, the polynucleotides of the invention do not include one of the 
following five primers, disclosed at http://contentMejm.org/cgi/reprint/NEJMoa030781v2.pdfi SEQ ID 
NOS: 6034-38. 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
SEQ ID NOS: 21-1020. The invention further includes polynucleotide sequence comprising the 
complement of one or more of the primer sequences identified in SEQ ID NOS: 21-1020. 
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The invention includes a polypeptide sequence comprising an amino acid sequence from 
tile sequence shown in Figure 23. Such amino acid sequences are SEQ ID NOS: 6588-6809. The 
invention includes polypeptides comprising an amino acid sequence having sequence identity to 
tiiese sequences, and tiie invention includes a fragment of a polypeptide comprising one of these 
5 sequences. 

The invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Figure 24. Such amino acid sequences are SEQ ID NOS: 6810-7179. The 
invention includes a protein comprising an amino acid sequence having £fequence identity to 
tiiese sequences, and the invention includes a fragment of a protein comprising one of these 
10 sequences. 

The invention includes a protein comprising SEQ ID NO: 6039. The invention includes a 
polypeptide comprising an amino acid sequence having sequence identity to SEQ ID NO: 6039. 
The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6039. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6039, or a 

15 fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence' 
encoding SEQ ID NO: 6039, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6039, or a fragment thereof. 
The invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6039, or a fragment thereof. SEQ ID NO: 6039 demonstrates functional homology with ORFla 

20 of coronaviruses. 

Predicted transmembrane or hydrophobic regions of SEQ ID NO: 6039 are identified 
below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramatically alter the architecture of host 

25 cell membranes. Accordingly, the hydrophobic domains of the polyprotein are targets for 

genetic mutation to develop attenuated SARS viras vaccines. The hydrophobic domains are also 
targets for small molecule inhibitors of the SARS virus. The hydrophobic domains may also be 
used to generate antibodies specific to those regions to treat or prevent SARS virus infection. 
Predicted Transmenibrane Helices in SEQ XD NOs 6039 

30 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 
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Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6039, wherein said fragment comprises an amino acid sequence including one or more of the 
hydrophobic transmembrane sequences identified above. The invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6039 wherein said fragment comprises one or more of the 
following polypeptide sequences of SEQ ID NO: 6039: 473-488, 529-549, 584-606, 773-791, 
2098-2119, 2145-2160, 2206-2224, 2316-2332, 2335-2358, 2373-2390, 2753-2770, 2831-2854, 
2879-2900, 2990-3012, 3024-3042, 3054-3075, 3105-3127, 3438-3455, 3559-3584, 3589-3606, 
3611-3629, 3659-3674, 3756-3777, 473-488, 583-606, 776-791, 2098-2119, 2208-2231, 2309- 
2332, 2342-2368, 2753-2770, 2832-2854, 2990-3005, 3020-3042, 3059-3075, 3105-3127, 3142- 
3162, 3437-3453, 3560-3577, 3591-3606, 3610-3627, 3656-3678, 3710-3738, 3723-3738, and 
3760-3777. Preferably, the fragment comprises one or more of the following polypeptide 
sequences of SEQ ID NO: 6039: 2206-2224, 2316-2332, 2335-2358, 2753-2770, 3024-3042, 
3054-3075, 3105-3127. 3589-3606, 3611-3629, 3756-3777, 2208-2231, 2753-2770, 3020-3042, 
3059-3075, and 3591-3606. Preferably, the fragment comprises one or more of the following 
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polypeptide sequences of SEQ ID NO: 6039: 2206-2224 and 3020-3042. The invention also 
includes polynucleotides encoding each of the polypeptide fragments identified above. 

The invention includes an attenuated SARS virus wheiein said attenuated SARS virus 
contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
5 hydrophobic domains identified above. The invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
viral genome of the SARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ ID NO: 6039 identified above. 

The invention includes an antibody which specifically identifies one or more of the 
10 hydrophobic regions of SEQ ID NO: 6039 identified above. The invention includes a small 

molecule which binds to, interferes with the hydrophobicity of or otherwise disrupts one or more 
of the hydrophobic regions of SEQ ID NO: 6039 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6039 are identified in the chart below. 
Prediction of N-glycosylatioa sifcee in SEQ id nOs 6039 
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Accordingly, the invention comprises a fragment of SEQ ID NO: 6039 wherein said 
fragment comprises an amino acid sequence which includes one or more of the N-glycosylation 
sites identified above. Preferably, the fragment comprises one or more sequences selected from 
the group consisting of SEQ ID NOS: 7180-7187 & 9764-9765. Preferably, the fiagment 
comprises the amino acid sequence NSSN (SEQ ID NO: 7185). 

The invention includes a polypeptide comprising a firagment of SEQ ID NO: 6039 wherein 
said firagment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6039 are identified in Table 13. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 7400-7639; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
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polypeptides Of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified as SEQ 
ID NOS: 7400-7639, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 

The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a hmnan) against SARS viral infection wheiein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The ORFla and ORFlb sequences of coronaviruses are typically translated as a single 
ORFlab polyprotein. Slippage of the ribosome during translation generates an a-1 frameshift. 
One region of such slippage is illustrated below: 

+3 G F Y T - K H S L Y R L R N V E R L W L - L 

+3 P T P R T L D A V C G C I N V F K R V C G V~ / ' 

gcagcccgtcttacaccgtgcggcacaggcactagtactg (SEQ ID NO: 7224) 
^3 A A R I. T r"" n"" L V I. (SEQ ID NOS: 7225-7226) 

+JAARI.TPCGTGTST (SEQ ID NOS: 7227-7229) 

Which would generate the following translational slippage (SEQ ID NOS- 7230-7231)- 

Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 7232. The 
invention includes a polypeptide comprising an amino acid sequence having sequence identity to 
SEQ ID NO: 7232. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
7232 The invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 
7232 or a fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide 
sequence encoding SEQ ID NO: 7232 or a firagment thereof. The invention includes an 
immunogenic composition comprising a polypeptide comprising SEQ ID NO: 7232 or a 
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£ragment thereof. The invention includes an antibody which recognizes a polypeptide 
comprising SEQ ID NO: 7232 or a fragment thereof. 

The invention also includes a polypeptide comprising amino acid sequence X1-X2-X3, 
where Xi is SEQ ID NO: 7233, X2 is from one to ten amino acids, and X3 is SEQ ID NO: 7234. 
5 X2 can comprise any sequence of one to ten amino acids (SEQ ID NOS: 7235-7244) but, in 

preferred embodiments, X2 is selected from the group consisting of F, FL, FLN, FLNR (SEQ ID 
NO: 7245), FLNRV (SEQ ID NO: 7246) and FLNRVC (SEQ ID NO: 7247). Preferably, X2 is 
SEQ ID NO: 7247. These preferred embodiments are shown as SEQ ID NOS: 7248-7253. 

The invention includes a polypeptide comprising an amino acid sequence having sequence 

10 identity to said amino acid sequences X1-X2-X3. The invention includes a fragment of a 
polypeptide comprising said amino acid sequences X1-X2-X3. The invention includes a 
diagnostic kit comprising a polypeptide comprising said amino acid sequences X1-X2-X3 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding said amino acid sequences X1-X2-X3 or a fragment thereof. The invention includes an 

15 immunogenic composition comprising a polypeptide comprising said amino acid sequences 
X1-X2-X3 or a fragment thereof. The invention includes an antibody which recognizes a 
polypeptide comprising said amino acid sequences X1-X2-X3 or a fragment thereof. 

The amino acid sequences X1-X2-X3 (i.e. SEQ ID NOS: 7235-7244) demonstrate 
functional homology with the polyprotein of murine hepatitis virus. This polyprotein is cleaved 

20 to produce multiple proteins. Proteins which can be generated from the X1-X2-X3 polyprotein, 
where X2 is six amino acids (SEQ ID NO: 7240) are listed below. 
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5903-6429 
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6430-6775 


NsplS 
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The invention includes a fragment of the amino acid sequence X1-X2-X3 (i.e. SEQ ID 
NOS: 7235-7244) wherein the fragment comprises one of the polypeptide sequences identified in 
25 the above table. The invention further includes a fragment of the amino acid sequence X1-X2-X3 
wherein said fragment comprises a polypeptide sequence which has a serine at its N-terminus 
and a glutamine at its C-terminus. The invention further includes a fragment of the amino acid 
sequence X1-X2-X3 wherein said fragment comprises a polypeptide sequence which has an 
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Alanine at its N-teiminus and a glutamine at its C-terminus. The invention further includes a 
fragment of the amino acid sequence X1-X2-X3 wherein said fragment comprises a polypeptide 
sequence which has a Asparagine at its N-terminus and a glutamine at its C-terminus. The 
invention further includes a fragment of the amino acid sequence X1-X2-X3 wherein said 
5 fragment comprises a Cysteine at its N-terminus and a Glutamine at its C-terminus. Each of the 
fragments identified above can be used in fusion proteins. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 
the fragments of the amino acid sequence X1-X2-X3 (i.e. SEQ ID NOS: 7235-7244) identified in 
the above paragraph. The invention includes a diagnostic kit comprising a polynucleotide 

10 sequence encoding at least one of the fragments of the amino acid sequence X1-X2-X3 identified 
in the above paragraph. The invention includes an inrnnmogenic composition comprising a 
polypeptide comprising at least one of the fragments of the amino acid sequence Xi-X2-X3 
identified in the above paragraph. The invention includes an antibody which recognizes a 
polypeptide comprising at least one of the fragments of the amino acid sequence X1-X2-X3 

15 identified in the above paragraph. 

Predicted N-glycosylation sites of the amino acid sequence X1-X2-X3 when X2 is six amino 
acids are identified at the asparagines located at the following amino acid positions 48; 389; 556; 
916; 1628; 1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 
4240; 5016; 5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; 6973. Accordingly, the 

20 invention comprises a firagment of SEQ ID NO: 7239 wherein said fragment is at least ten amino 
acids and wherein said fragment comprises one or more of the asparagines from the amino acid 
positions of SEQ ID NO: 7239 selected from the group consistmg of 8; 389; 556; 916; 1628; 
1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 4240; 5016; 
5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; and 6973. 

25 A zinc binding region 2 site within SEQ ID NOS: 7235-7244 is identified at amino acid 

residues 2102-21 12 (SEQ ID NO: 7254 HGIAAINSVPW). The polypeptide of SEQ ID NOS: 
7235-7244 will be processed by the SARS virus into multiple peptides. This zinc binding region 
falls within the nspl region of the polypeptide. SEQ ID NO: 7254 is a target for screening of 
chemical inhibitors to the SARS virus. The invention includes a polypeptide comprises SEQ ID 

30 NO: 7254. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ 
ID NO: 7254. The invention includes a method of screening SEQ ID NO: 7254 for an inhibitor. 
The invention includes the recombinant expression of SEQ ID NO: 7254 in a host cell. The 
invention includes a fragment of SEQ ID NOS: 7235-7244, wherein said fragment comprises 
SEQ ID NO: 7254. The invention includes a polypeptide comprising SEQ ID NO: 7254 wherein 

35 said polypeptide is complexed with a zinc ion. The invention includes a small molecule which 
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prevents a zinc ion from complexing with the polypeptide of SEQ ID NO: 7254. The invention 
includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 7254. 

The polyprotein encoded by the SARS virus will contain at least two protease domains: a 
papain-like cystein protease (PLP) and a chymotrypsin-picomavirus 3C-like protease (3CLp). 

5 (There may be more than one copy of the PLP domain). These proteases function to cleave the 
polyprotein into multiple smaller proteins. The 3C-like protease, also known as the "main 
protease" or Mpix), is itself cleaved from the polyprotein by its own autoprotease activity. See 
generally, Chapter 35 of Fields Virology (2nd ed). Fields et al (eds.), B.N. Raven Press, New 
York, NY, and Anand et al, EMBO Journal (2002) 21 (13): 3213-3224. This 3CLp generally 

10 corresponds with the Nsp2 region identified above. 

The SARS virus 3CLp protein is further characterized by SEQ ID NO: 6569 (also SEQ ID 
NO: 9769), as shown in FIGURE 15. 

FIGURE 16 also illustrates the SARS virus 3CLp, in allignment with the 3CLp of avian 
infectious bronchitis (IBV; SEQ ID NO: 6570), mouse hepatitis virus (MHV; SEQ ID NO: 

15 6571), and bovine coronavirus (BCoV; SEQ ID NO: 6572). Accordingly, the invention includes 
a polypeptide sequence comprising SEQ ID NO: 6569, or a fragment thereof, or a polypeptide 
sequence having sequence identity thereto. The invention further includes a polynucleotide 
sequence encoding SEQ ID NO: 6569, or a fragment thereof. The invention includes a 
polynucleotide sequence encoding a polypeptide sequence having sequence identity to SEQ ID 

20 NO: 6569. 

The invention further includes a method of screening for an inhibitor of the SARS virus 
3CLp protein. In one embodiment, the invention includes a method of screening for an inhibitor 
of SEQ ID NO: 6569. The invention includes a method of recombinantly expressing the SARS 
virus 3CLp protein in a host cell. The invention includes a method of recombinantly expressing 

25 a polypeptide sequence comprising SEQ ID NO: 6569 or an enzymatically active fragment 
thereof or a polypeptide sequence having sequence identity thereto. The invention includes a 
small molecule which inhibits or reduces the proteolytic activity of the SARS virus 3CLp 
protein. The invention includes a small molecule which inhibits or reduced the proteolytic 
activity of the polypeptide comprising SEQ ID NO: 6569. 

30 Catalytic residues of the SARS virus 3CLp are identified in FIGURE 15 and 16. 

Specifically, a catalytic histidine and a catalytic cysteine are identified. Such catalytic sites are 
targets for small molecules which could inhibit or reduce the protease activity of 3CLp. 
Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6569, 
wherein said fragment comprises at least one catalytic site. Preferably, the catalytic site is 

35 selected from the group consisting of the indicated catalytic histidine and the catalytic cysteine in 
FIGURE 15 and 16. The invention includes a polynucleotide encoding a polypeptide, wherein 
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said polypeptide comprises a fragment of SEQ ID NO: 6569, wherein said firagment comprises at 
least one catalytic site. Preferably, the catalytic site is selected from the group consisting of the 
indicated catalytic histidine and the catalytic cysteine. 

The invention further includes a method of screening a compound library to identify a 
small molecule which inhbits a catalytic site of a SARS virus 3CLp. Preferably, the 3CLp 
comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence having sequence identity 
thereto. The catalytic site is preferably selected from the group consisting of the indicated 
catalytic histidine and the catalytic cysteine in FIGURE 15 and 16, 

The invention includes a small molecule which inhibits the catalytic site of a SARS virus 
3CLp. Preferably, the 3CLp comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence 
having sequence identity thereto. The catalytic site is preferably selected from the group 
consisting of the indicated catalytic histidine and the catalytic cysteine in FIGURE 15 and 16. 

Residues of the substrate site of the SARS virus 3CLp are identified in FIGURE 15 aijd 16. 
Specifically, a substrate site is indicated at a phenylalanine, a tyrosine and a histidine. Such 
substrate sites are targets for small molecules which could inhibit or reduce the protease activity 
of 3CLp. Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 6569, wherein said fragment comprises at least one substrate site. Preferably, the substrate 
site is selected from the group consisting of the indicated substrate phenylalanine, tyrosine and 
histidine in FIGURE 15 and 16. The invention includes a polynucleotide encoding a 
polypeptide, wherein said polypeptide comprises a fragment of SEQ ID NO: 6569, wherein said 
firagment comprises at least one substrate site. Preferably, the substrate site is selected from the 
group consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 
and 16. 

The invention further includes a method of screening a compound library to identify a 
small molecule which blocks a substrate site of a SARS virus 3CLp. Preferably, the 3CLp 
comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence having sequence identity 
thereto. The substrate site is preferably selected firom the group consisting of the indicated 
substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention includes a small molecule which inhibits the substrate site of a SARS virus 
3CLp. Preferably, the 3CLp comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence 
having sequence identity thereto. The substrate site is preferably selected from the group 
consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention further includes a diagnostic kit comprising a polynucleotide encoding a 
SARS virus 3CLp or a fragment thereof. Preferably, the SARS vims 3CLp comprising SEQ ID 
NO: 6569 or a fragment thereof or a polypeptide sequence having sequence identity thereto. 
Preferably, the firagment comprising one or more sites selected fix)m the group consisting of a 
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catalytic site and a substrate site. Preferably, the catalytic site is selected from the group 
consisting of one or more of the sites identified in FIGURE 15 and 16. Preferably, the substrate 
site is selected from the group consisting of one or more of the sites identified in FIGURE 15 
and 16. 

5 The invention further comprises a diagnostic kit comprising an antibody specific to a 

SARS virus 3CLp or a fragment thereof. Preferably, the antibody is specific to the polypeptide 
comprising SEQ ID NO: 6569 or a fragment thereof or a polypeptide sequence having sequence 
identity thereto. Preferably, the antibody is specific to one or more sites of a SARS virus 3CLp 
selected from the group consisting of a catalytic site and a substrate site. Preferably, the catalytic 

10 site is selected from the group consisting of one or more of the sites identified in FIGURE 15 
■ and 16. Preferably, the substrate site is selected from the group consisting of one or more of the 
sites identified in FIGURE 15 and 16. 

The invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Figure 25. The two amino acid sequences within Figure 25, separated by a *, ■ 

15 are SEQ ID NOS: 7188 & 7189. The invention ii^cludes a polypeptide comprising an amino acid 
sequence having sequence identity to the figure 25 translation. The invention includes a 
fragment of a polypeptide comprising the figure 25 sequence. The invention includes a 
diagnostic kit comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence encoding ■ 

20 the figure 25 translation, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. The invention includes an antibody which recognizes a polypeptide comprising the 
figure 25 sequence, or a fragment thereof. The figure 25 sequence demonstrates functional 
homology with ORFlb of coronaviruses. 

25 SEQ ID NO: 7188 is an open reading frame within Figure 25. The invention includes a 

polypeptide comprising SEQ ID NO: 7188. The invention includes a polypeptide comprising an 
amino acid sequence having sequence identity to SEQ ID NO: 7188. The invention includes a 
fiagment of a polypeptide comprising SEQ ID NO: 7188. The invention includes a diagnostic 
kit comprising a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. The 

30 invention includes a diagnostic kit comprising a polynucleotide sequence encoding SEQ ED NO: 
7188, or a fragment thereof. The invention includes an inmiuriogenic composition comprising a 
polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. The invention includes an 
antibody which recognizes a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. 
SEQ ID NO: 7190 is an open reading frame within SEQ ID NO: 7188. The invention 

35 includes a polypeptide comprising SEQ ID NO: 7 190, a fragment thereof or a polypeptide 

having sequence identity thereto. The invention further includes a polynucleotide encoding SEQ 
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ID NO: 7190, a fragment thereof or a polypeptide sequence having sequence identity thereto. 
An example of a polynucleotide encoding SEQ ID NO: 7190 is given as SEQ ID NO: 7191. 

SEQ ID NO: 7188 also contains an open reading frame comprising SEQ ID NO: 6042. 
The invention includes a polypeptide comprising SEQ ID NO: 6042. The invention includes a 
polypeptide comprising an amino acid sequence having sequence identity to SEQ ID NO: 6042. 
The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6042. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6042, or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ID NO: 6042, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6042, or a fragment thereof. 
The invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6042, or a fragment thereof. SEQ ID NO: 6042 demonstrates functional homology to a 
coronavirus spike protein. 

Predicted transmembrane regions of SEQ ID NO: 6042 are identified below. 

Predicted, Transmembrane helices o£ SEQ ZD NO: 6042 



The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices : 18 found Outside to inside helices : 13 found 





from 


to 


score 


center 




from 




to 


score 


center 


1 


( 1) 16 


{ 16) 


959 


9 


1 


( 1) 


17 


{ 17) 


684 


10 


233 


( 237) 257 


( 252) 


905 


244 


222 


{ 222) 


240 


{ 237) 


238 


229 


345 


( 347) 364 


( 361) 


490 


354 


244 


{ 247) 


264 


( 264) 


613 


254 


345 


( 354) 369 


( 369) 


420 


362 


349 


( 355) 


369 


( 369) 


314 


362 


497 


( 497) 513 


( 513) 


239 


506 


496 


( 496) 


511 


( 511) 


488 


503 


573 


( 573) 588 


( 588) 


811 


580 


573 


( 573) 


591 


( 591) 


712 


581 


645 


( 648) 666 


( 663) 


302 


656 


650 


( 652) 


666 


( 666) 


474 


659 


690 


( 696) 714 


( 711) 


428 


704 


674 


( 679) 


702 


( 696) 


190 


686 


857 


( 860) 882 


( 874) 


1508 


867 


691 


( 696) 


713 


{ 711) 


210 


704 


1031 


{1031)1046 


(1046) 


446 


1039 


866 


( 868) 


886 


( 886) 


1172 


876 


1199 


(1203)1219 


(1217) 


2667 


1210 


1198 


(1201)1215 


(1215) 


3221 


1208 



SEQ ID NO: 6042, the spike protein, is a surface exposed polypeptide. Recombinant 
expression of a protein can be hindered by hydrophobic transmembrane regions. Accordingly, 
the invention includes a polypeptide comprising SEQ ID NO: 6042 wherein one or more of the 
hydrophobic regions identified above is removed. The invention further includes a 
polynucleotide encoding such a polypeptide. The invention includes recombinantly expressing 
the protein in a host cell. Primers for amplifying the gene for spike protein and fragments 
thereof, such as fragments encoding the soluble ectodomain, include SEQ ID NOS: 9753-9763 
(Xiao et al (2003) Biochem Biophys Res Comm. 312:1159-1164). 

Further characterization of SEQ ID NO: 6042 is set forth below. 

PSOR3* Prediction o£ Protein Xiocalization Sites 
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version 6.4 (WWW) 

SEQ ID NO: 6042 - 1255 Residues 
Species classification: 4 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 2 

Position of the most N-terminal TMS: 496 at i=2 
MTOP: membrane topology (Hartmann et al.) 

I (middle): 503 Charge diffirence{C-N) : 1.0 
McG: Examining signal sequence (McGeoch) _. _ 

Dength of UR: 13 

Peak Value of UR: 3.28 

Net Charge of CR: 0 

Discriminant Score: 8.66 
GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 5.94 

Possible cleavage site: 13 
»> SeCTis to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 14 
ALOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : OB 
ALOM: finding transmembrane regions (Klein et al.) 

count: 1 value: -12.2 6 threshold: -2.0 

INTEGRAL Likelihood =-12.26 Transmenbrane 1202-1218 (1194-1228)' 

PERIPHERAL Likelihood =0.16 

modified ALOM score: 2.55 , 
»> Seems to be a Type la meinbrane protein 

The cytoplasmic tail is from 1219 to 1255 (37 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
(14) or uncleavable? 

Gavel: Examining the bovmdary of mitochondrial targeting seq. 
motif at: 14 

Uncleavable? Ipos set to: 24 • 
Discrimination of mitochondrial target seq. : 

positive (2.18) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step; 2 

KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 

(Gavel position 24) from: 1 to: 10 Score: 8.0 
SKL motif (signal for peroxisomal protein) : 

pos: 964(1255), count: 1 SRL 

SKL score (peroxisome) : 0.1 
Amino Acid Composition Tendency for Peroxisome: 1.37 

AAC not from the N-term. , score modified 
Peroxisomal proteins? Status; notclr 

AAC score (peroxisome) : 0.079 
Amino Acid Composition tendency for lysosomal proteins 

score: 0.39 Status: notclr 
GY motif in the tail of typela? (lysosomal) 
Checking the amoxont of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
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Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dincfwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Type la is favored for plasma memb. proteins 
Checking the NPXY motif.. 
Checking the YXRF motif . . 
Checking N-myristoylation. . 

Final Results 

plasma menOsrouie Certainty= 0 .460 (Affirmative) < suco 

microbody (peroxisome) Certaiaty= 0 .171(Af f izmative) < suco 

endoplasmic reticul-um (membrane) Certainty= O.lOO(Affixnative) < suco 

endoplasmic reticulum (lumen) Certainty= 0 .100 (Affirmative) < suco 

SEQ ID NO: 6042 appears to have a N-terminus signaling region, followed by a surface 
exposed region, followed by a transmembrane region followed by a C-terminus cytoplasmic 
domain region. Accordingly, the invention includes an immunogenic, surface exposed fragment 
of SEQ ID NO: 6042. Preferably, said fragment comprises an amino acid sequence which does 
not include the last 50 amino acids of the C-terminus of SEQ ID NO: 6042. Preferably, said 
fragment comprises an amino acid sequence which does not include the last 70 amino acids of 
the C-terminus of SEQ ID NO: 6042. Preferably, said fragment does not include a transdomain • 
region of SEQ ID NO: 6042. Preferably, said fragment does not include a C-terminus 
cytoplasmic domain of SEQ ID NO: 6042. Preferably, said fragment does not include a N- 
terminus signal sequence. Preferably, said fragment does not include amino acids 1-10 of the N- 
terminus of SEQ ID NO: 6042. Preferably, said fragment does not include amino acids 1-14 of 
the N-terminus of SEQ ID NO: 6042. Two oligopeptide fragments of SEQ ID NO: 6042 that are 
able to elicit anti-spike antibodies are SEQ ID NOS: 7398 & 7399, as described (with additional 
C-terminus cysteines) by Xiao et al. (2003) Biochem Biophys Res Comm 312:1159-1164. 
C-terminal truncations of spike protein, with removal of part of the cytoplasmic region, or 
removal upto and including the transmembrane region, are described by Yang et al (2004) 
Nature 428:561-564. 

A variant of SEQ ID NO: 6042 that is included within the invention is SEQ ID NO: 9962. 
Compared to SEQ ID NO: 6042, this sequence has Ser at residue 581 instead of Ala, and has Phe 
at residue 1152 instead of Leu. 

The spike protein of coronaviruses may be cleaved into two separate chains into SI and S2. 
The chains may remain associated together to form a dimer or a trimer. Accordingly, the 
invention includes a polypeptide comprising SEQ ID NO: 6042 wherein said polypeptide has 
been cleaved into SI and S2 domains. The invention further includes a polypeptide comprising 
SEQ ID NO: 6042 wherein amino acids 1-10, preferably amino acids 1-14 of the N-terminus are 
removed and further wherein SEQ ID NO: 6042 is cleaved into SI and S2 domains. Preferably 
the polypeptide is in the form of a trimer. 
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The spike protein appears to form an alpha-helical structure in the transmembrane region 
of the protein, preferably in the S2 domain. This alpha-helical structure is thought to associate 
with at least two additional spike proteins to form a trimer. Helical or coiled regions of the spike 
protein are identified below. Predicted coiled-coils of SEQ ID NO: 6042 (spike protein) are at 
5 amino acids 900-1005 and 1151-1185 (see Figure 12). 

Accordingly, the invention comprises a polypeptide sequence comprising a fragment of 
SEQ ID NO: 6042 wherein said fragment includes a coiled region of SEQ ID NO: 6042. Said 
fragment preferably includes the amino acid sequences selected from the group consisting of 
amino acid positions 900 to 1005 and amino acid positions 1151 to 1185 of SEQ ID NO: 6042. 

10 The invention comprises a polypeptide sequence comprising a fragment of SEQ ID NO: 6042, 
wherein said fragment does not include a coiled region of SEQ ID NO: 6042. Said fragment 
preferably includes the amino acid sequences selected from the group consisting of amino acid 
positions 900 to 1005 and amino acid positions 1151 and 1185 of SEQ ID NO: 6042. 
The spike protein is believed to play an integral role in fusion and infection of 

15 Coronaviruses with mammalian host cells. Analysis of coronavirus spike proteins as well as 
similar surface proteins in other viruses has identified at least two structural motifs, typically 
located within the S2 domain, associated with this fusion event: heptad repeats (HR) and 
membrane fusion peptides. 

At least two 4,3 hydrophobic heptad repeat (HR) domains are typically found in the 

20 ectodomain of the S2 domain of Coronaviruses. One heptad repeat region (HRl) is typically 

located adjacent to a fusion peptide while a second heptad region (HR2) is typically located near 
the C-terminus of the S2 domain, close to the transmembrane anchor. Heptad repeats are 
characteristic of coiled-coil structures and the heptad repeats found in viral surface proteins (such 
as coronavirus spike protein) are thought to form bundled heUx structures which are involved in 

25 viral entry. See Bosch et al, J. Virology (2003) 77:8801-881 1 (Figure IB of this reference 
illustrates an alignment of the HRl and HR2 regions of five coronaviruses along with SARS, 
annotated "HCov-SARS"). 

Heptad repeats generally contain a repeating structure of seven amino acids, designated a- 
b-c-d-e-f-gy where hydrophobic sidechains of residues a and d typically form an apolar stripe, 

30 and electrostatic interactions are found in residues e and g. Position a is most frequently Leu, He 
or Ala and position d is usually Leu or Ala. Residues e and g are often Glu or Gin, with Arg and 
Lys also prominent at position g. Charged residues are common to positions b, c and/ as these 
residues may be in contact with solvent. Exceptions to these general parameters are known. For 
instance Pro residues are sometimes found within the heptad. 

35 The HRl and HR2 sequences of an MHV strain have been postulated to assemble into a 

thermostable, oligomeric, alphahelical rold-like complex, with the HRl and HR2 helices 
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oriented in an antiparallel manner. Id. In this same study, HR2 was asserted to be a strong 
inhitibor of both virus entry into the cell and cell-cell fusion. 

HRl and HR2 sequences have been identified in the SARS virus genome. The SARS virus 
HRl region comprises approximately amino acids 879 to 1005 of SEQ ID NO: 6042 or 
5 fragments thereof capable of forming at least one alpha-helical turn. Preferably, said fragments 
comprise at least 7 {e.g., at least 14. 21, 28, 35, 42, 49 or 56) amino acid residues. SEQ ID NO: 
7192, includes amino acids 879 to 1005 of SEQ ID NO: 6042. 

A preferred fragment of HRl comprises amino acid residues 879 to 980 of SEQ ID NO: 
6042. This preferred firagment is SEQ ID NO: 7193. 
10 Another preferred fragment of HRl comprises amino acid residues 901 to 1005 of SEQ 

ID NO: 6042. This preferred fragment is SEQ ID NO: 7194. 

The SARS virus HR2 region comprises approximately amino acids 1144 to 1201 of SEQ 
ID NO: 6042, or fragments thereof capable of forming at least one alpha-helical turn. Preferably, 
said fragments comprise at least 7 (e.g., at least 14, 21, 28, 35, 42, 49 or 56) amino acid residues. 
15 SEQ ID NO: 7195 includes amino acids 1 144 to 1201. A preferred fragment of HR2 comprises 
amino acids 1144 to 1195 of SEQ ID NO: 6042. This preferred fragment is SEQ ID NO: 7196. 

Membrane Fusion peptides sequences within the spike protein are also believed to 
participate in fusion (and infection) of the virus with a host cell. Fusion peptides generally 
comprise about 16 to 26 amino acid residues which are conserved within viral families. These 
20 Membrane Fusion peptides are relatively hydrophobic and generally show an asymmetric 

distribution of hydrophobitiy when modeled into an alpha helix. They are also generally rich in 
alanine and glycine. 

At least three hydrophobic Membrane Fusion peptide regions have been identified within 
coronaviruses (PEPl, PEP2, and PEP3). See, Luo et al, 'TRoles in Cell-Cell Fusion of Two 
25 Conserved Hydrophobic Regions in the Murine Coronavirus Spike Protein", Virology (1998) 

244:483-494. Figure 1 of this paper shows an aUgnment of Membrane Fusion peptide sequences 
of Mouse Hepatitis Viris, Bovine Corona Virus, Feline Infectious Peritonitis Virus, 
Transmissible Gasti-oenteritis Virus and Infectious Bronchitis Virus. See also, Bosch et cd., "The 
Coronavirus Spike Protein is a Class I Virus Fusion Protein: Stiiactural and Functional 
30 Characterization of the Fusion Core Complex" Joumal of Virology (2003) 77(16):8801-8811. 

PEPl (SEQ ID NO: 7197), PEP2 (SEQ ID NO: 7198) and PEP3 (SEQ ID NO: 7199) 
sequences within the SARS spike protein have been identified. 

The coronavirus spike proteins (and other similar surface viral proteins) are thought to 
undergo a conformational change upon receptor binding to the target cell membrane. One or 
35 more of tiie hydrophobic Membrane Fusion peptides are tiiought to become exposed and inserted 
into the target membrane as a result of this conformational change. The free energy released 
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upon subsequent refolding of the spike protein to its most stable conformation is believed to play 

a role in the merger of the viral and cellular membranes. 

One or more SARS HR sequences, preferably HR2, or a fragment thereof may be used to 

inhibit viral entry and membrane fusion with a target mammalian host cell. The invention 
5 provides a method of inhibiting viral infection comprising administering a composition 

comprising one or more SARS HR polypeptides or a fragment thereof. Preferably, the 

composition comprises a SARS HR2 sequence. 

In another embodiment, the invention includes a composition comprising a SARS HRl 

sequence, or a fragment thereof and a SARS HR2 sequence, or a fragment thereof. The HRl and 
10 HR2 sequences may optionally be associated together in an oligomer. The composition may 

comprise the intermediate domain sequence between the HRl and HR2 domains. The use of 

such an intermediate sequence may facilitate oligomerization or other structural interaction 

between the HR regions. 

HR sequences for use in the invention may be produced recombinantly by methods known 
15 in the art. The SARS HR sequences may be modified to facilitate bacterial expression. In 

particular, the HR sequences may be modified to facilitate transport of the recombinant protein 

to the surface of the bacterial host cell. For example, leader sequences to a bacterial membrane 

protein may be added to the N terminus of the recombinant HR sequences. HR sequences for use 

in the invention may alternatively be produced by chemical synthesis by methods known in the 
20 art (see below). 

As discussed in more detail later in the specification. Applicants have identified stmctural 
similarities between the SARS spike protein and the surface protein of Neisseria meningitidis, 
NadA (and other similar bacterial adhesion proteins). Another means of facilitating bacterial 
expression of HR sequences includes the addition of the stalk and/or anchor sequences of a 
25 NadA-like protein to the C-terminus of the recombinant HR sequences. Recombinant sequences 
containing the bacterial anchor sequence may preferably be prepared in outer membrane vesicles 
(the preparation of which is discussed in more detail later in the application). Recombinant 
sequences missing the bacterial anchor sequences may be secreted and isolated from the 
supernatant. 

30 The invention includes a polypeptide sequence comprising a first sequence and a second 

sequence, wherein said first sequence comprises a leader sequence for a bacterial membrane 
protein and wherein said second sequence comprises a HR sequence of a coronavirus. 
Preferably, said first sequence comprises the leader sequence for a bacterial adhesin protein. 
More preferably, said bacterial adhesion protein is NadA. Preferably said second sequence 

35 comprises HRl, HR2 or both. In one embodiment, the second sequence comprises HRl, HR2 
and the intermediate domain sequence present in the naturally occrding spike protein. For 
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example, the second sequence niay comprise a fragment of a coronaviras spike protein 
comprising the amino acids starting with the N-tenmnus of the HRl region and ending with the 
C-terminus of the HR2 region. 

The invention further includes a polypeptide sequence comprising a first, second, third and 
fourth sequence, wherein the first sequence comprises a leader sequence for a bacterial 
membrane protein; wherein said second sequence comprises a HR sequence of a coronavirus; 
wherein said third sequence comprises a stalk domain of a bacterial adhesion protein; and 
wherein said fourth sequence comprises an anchor domain of a bacterial adhesion protein! In 
one embodiment, the first sequence comprising the leader peptide sequence is removed. In 
another embodiment, the third sequence comprising the stalk domain is removed. In another 
embodiment, the fourth sequence comprising the anchor domain is removed. 

The polypeptide sequences of the above described constructs may be linked together by 
means known in the art, including, for example, via glycine Unkers. 

' Examples of constructs which may be used.in such bacterial expression systems are shown 
in FIGURE 50. Polypeptide sequences of each of the constructs illustrated in FIGURE 50 are 
given as SEQ ID NOS: 7200 to 7206. 

7200 Leader NadA (1-29) - HRl (879-980) - 6Xgly - HR2 (1144-1 195) - stalk+anchor NadA (88-405) 

7201 Leader NadA (1-29) - HRl (879-980) - 6Xg]y - HR2 (1 144-1196) - stalk NadA (88-351) 

7202 Leader NadA (1-29) - HRl - HR2 (879-1 196) - stalk+anchor NadA (88-405) 

7203 Leader NadA (1-29) - HRl - HR2 (879-1 196)-stalk NadA (88-351) 

7204 HRl : HR2 (879-1 196)-stalk NadA (88-35 l)-6xhis 

7i205 Leader NadA (1-29) - HRl - HR2 (879-1 196)-anchor NadA (351-405) 
7206 Leader NadA (1-29) - HRl - HR2 (879- 1 196) 

Administration of one of more of these Membrane Fusion sequences may also interfere 
with the ability of a coronavirus to fuse to a host ceD membrane. Accordingly, the invfention 
includes an isolated polypeptide comprising an amino acid sequence selected ftom the group 
consisting of SEQ ID NO: 7197. SEQ ID NO: 7198and SEQ ID NO: 7199. The invention 
fijrther includes an isolated polypeptide comprising an amino acid sequence having sequence 
homology to an amino acid sequence selected from the group consisting of SEQ ID NO: 7197, 
SEQ ID NO: 7198and SEQ ID NO: 7199. 

Two or more of these SARS Membrane Fusion peptides can be combined together. The 
invention includes a composition comprising two SARS Membrane Fusion peptides wherein said 
peptides are selected from at least two of the amino acids selected from the group consisting of 
SEQ ID NO: 7197. SEQ ID NO: 7198and SEQ ID NO: 7199, or a sequence having sequence 
identity thereto. 

Two or more of the SARS Membrane Fusion peptides may be linked together. 
Accordingly, the invention includes a polypeptide comprising a first amino acid sequence and a 
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second amino acid sequence, wherein said first and second amino acid sequences are selected 
fix>m the group consisting of SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199, or a 
sequence having sequence identity thereto. Preferably, said first amino acid sequence and said 
second amino acid sequence are different SARS Membrane Fusion peptides, i.e., they are not the 
5 same. 

The invention also includes a method of treating or preventing SARS virus infection 
comprising administering one or more of the SARS Membrane Fusion peptide compositions 
described above. 

As discussed above, the spike protein is capable of forming trimers. The invention further 
10 includes a polypeptide comprising SEQ ID NO: 6042 in trimeric form. The invention includes a 
composition comprising at least polypeptides wherein each polypeptide comprises at least the 
alpha-helical coiled region of a SARS virus spike protein. Preferably, the spike protein 
comprises SEQ ID NO: 6042 or a fi-agment thereof. 

The invention further includes a composition comprising a SARS virus spike protein or a ■ 
15 fragment thereof wherein said protein is associated with a transmembrane and wherein said 
fragment comprises the alpha-helical region of the SARS virus spike protein. Preferably, the 
composition comprises at least three SARS virus spike proteins or a fragment thereof, wherein 
the fragment comprises the alpha-helical region of the SARS virus spike protein. 

The invention further includes an antibody which specifically binds to a trimeric form of 
20 SARS virus spike proteins. Preferably, the spike protein comprises SEQ ID NO: 6042 or a 

fragment thereof. The invention includes an antibody which specifically binds to a trimeric form ; 
of SARS virus spike proteins wherein said proteins are associated with a transmembrane. 

The invention further includes an antibody which specifically binds to a monomeric form 
of SARS virus spike protein or a fragment thereof. Preferably, the antibody specifically binds to 
25 a monomeric form of SEQ ID NO: 6042 or a firagment thereof. 

The invention further includes a small molecule which interferes with or disrupts the 
coiling of a SARS viral spike protein trimer. 

The invention further includes an attenuated SARS virus for use as a vaccine wherein said 
attenuated virus contains a polynucleotide insertion, deletion or substitution which does not 
30 disrupt the trimeric conformation of the SARS virus spike protein. The invention fiirther 

includes an attenuated SARS virus for use as a vaccine wherein said attenuated virus contains a 
polynucleotide insertion, deletion or substitution which does not disrupt the alpha-helical 
formation of the SARS virus spike protein. 

The spike protein may be recombinantly produced. In one embodiment, the spike protein 
J5 is expressed in virus like particles so that the protein is attached to a cell membrane. Such 

attachment may facilitate presentation of immunogenic epitopes of the spike protein. Preferably, 
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the alpha-helical portion of the spike protein is associated with the cell membrane. Preferably, 
the spike proteins form a trimer within the transmembrane region of attachment 
Predicted N-glycosylation sites of SEQ ID NO: 6042 are identified below: 



Position 

29 NYTQ 
65 NVTG 
109 NKSQ 
119 NSTN 
158 NCTF 
227 NITN 
269 NGTl 
318 NITN 
330 NATK 
357 NSTF 
589 NASS 
602 NCTD 
699 NFSI 
783 NFSQ 
1080 NGTS 
1116 NNTV 
1176 NESL 



SEQ ID NO: 7207 
SEQ ID NO: 7208 
SEQ ID NO: 7209 
SEQ ID NO: 7210 
SEQ ID NO: 
SEQ ID NO 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO 

SEQ ID NO. 

SEQ ID. NO: 7217 
SEQ ID NO: 7218 
SEQ ID NO: 7219 
SEQ ID NO: 7220 
SEQ ID NO: 7221 
SEQ ID NO: 7222 
SEQ ID NO: 7223 



Potential Jury NGlyc 
agreement result 



7211 
7212 
7213 
7214 
7215 
7216 



0.7751 
0.8090 
0.6081 
0.7039 
0.5808 
0.7518 
0.6910 
0.6414 
0.6063 
0.5746 
0.5778 
0.6882 
0.5357 
0.6348 
0.5806 
0.5106 
0.6796 



(9/9) 
(9/9) 
(7/9) 
(9/9) 
(7/9) 
(9/9) 
(9/9) 
(9/9) 
(8/9) 
(8/9) 
(6/9) 
(9/9) 
(7/9) 
(9/9) 
(7/9) 
(5/9) 
(9/9) 



Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6042 wherein said fragment comprises one or more of the glycosylation sites identified above 
(SEQ ID NOS: 7207-7223). The invention further includes a polynucleotide encoding one or 
more of the fragments identified above. This glycosylation site can be covalentiy attached to a 
saccharide. Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 6042 wherein said firagment comprises one or more of the glycosylation sites identified . 
above and wherein said polypeptide is glycosylated at one or more of the sites identified above. 

Predicted O-glycosylation sites are identified below: 

Residue No. Potential Threshold Asslcrr,Tn»r,h 
Thr 698 
Thr 706 
Thr 922 
Ser 36 
Ser 703 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6042 wherein 
said fi-agment comprises one or more of the O-glycosylation sites identified above. The 
invention further includes a polynucleotide encoding one or more of the fragments identified 
above. The invention fiirther includes a polypeptide comprising a firagment of SEQ ID NO: 
6042 wherein said firagment comprises one or more of the O-glycosylation sites identified above 
and further wherein the polypeptide is covalentiy bonded to a saccharide at one or more of the 
included glycosylation sites. 

The invention further includes a polypeptide comprising a fragment of SEQ ID NO: 6042 
wherein said fragment comprises one or more of the N-glycosylation sites identified above and 
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further wherein said fragment comprises one or more of the O-glycosylation sites identified 
above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6042 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
5 invention also includes a polynucleotide encoding such a polypeptide. 

Predicted phosphorylation sites of SEQ ID NO: 6042 are Ser-346, Tyr-195, and Tyr-723. 
Accordingly, the invention comprises a polypeptide comprising a fragment of SEQ ID NO: 6042 
wherein said fragment comprises at least ten amino acid residues and wherein said fragment 
comprises one or more of the amino acids selected from the group consisting of Ser-346, Tyr- 
10 195, and Tyr-723. In one embodiment, one or more of the amino acids selected from the group 
consisting of Ser-346, Tyr-195, and Tyr-723 are phosphorylated. 

Expression and functional characterization of the spike glycoprotein has been described by 
Xiao et al. (2003) Biochem Biophys Res Comm. 312:1159-1164. 

T-epitopes for SEQ ID NO: 6042 are identified in Table 16. The invention includes a 
15 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected froni the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 8041-8280; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
20 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8041-8280, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
25 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

30 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6040. The invention 

35 includes a polypeptide comprising an amino acid sequence having sequence identity to SEQ ID 
NO: 6040. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6040. 
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The invention includes a polynucleotide encoding SEQ ID NO: 6040 or a fragment thereof. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6040 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ID NO: 6040 or a fragment thereof. The invention includes an antibody which 
recognizes a polypeptide comprising SEQ ID NO: 6040 or a fragment thereof. 

SEQ ID NO: 6040 demonstrates functional homology with a membrane protein of 
coronaviruses. Predicted transmembrane helices of SEQ ID NO: 6040 are identified below: 

Predicted Transmembrane Helices 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices . : 3 found 
from to score center 

27 ( 30) 48 ( 45) 1138 38 
137 ( 139) 153 ( 153) 486 146 

Outside to inside helices : 3 found 
from to score center 

28 {■ 31) 45 ( 45) 819 38 
71 { 73) 90 ( 90) 210 81 

136 ( 142) 156 ( 156) 272 149 

The amino acid region with the highest predicted transmembrane helical region is from 
amino acid position 27 to 48 of SEQ ID NO: 6040. Such transmembrane regions are often 
difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6040 wherein said fragment does not include the amino 
acid sequence between positions 27 to 48. The invention includes a polypeptide comprising a 
fragment of SEQ ID NO: 6040 wherein said fragment does not include the amino acid sequence 
between positions 28 to 45. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6040 is predicted to be a hypothetical protein of the SARS virus. A 
prediction of the protein localization of SEQ ID NO: 6040 is set forth below. SEQ ID NO: 6040 
is predicted to be located in one of the following locations: mitochondrial matrix space, 
microbody (peroxisome), nucleus, and mitochondrial inner membrane. SEQ ID NO: 6040 is 
predicted to be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO; 6040 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6040 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6040 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6040 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6040 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
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ID NO: 6040 fix)m associating with an organelle inside of an infected cell. The invention 

includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 6040. 

PSORT Prediction of Protein Ziocalization sites 

version 6.4 (WWW) 

SEQ ID NO: 6040 163 Residues 

Species classification: 4 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM < threshold : 0.5) 
coiant : 0 

McG: Examining signal sequence (McGeoch) 

Length of UR: 9 

Peak Value of UR: 1.75 

Net Charge of CR: 1 

Discriminant Score: -2.56 
GvH: Examining signal sequence (von Hei^ne) 

Signal Score (-3.5): 1.94 

Possible cleavage site: 53 
>» Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : OB 
ALOM: finding transmembrane regions (Klein et al.) 

count: 0 value: 1.32 threshold: -2.0 

PERIPHERAL Likelihood = 1.32 , 

modified ALOM score: -1.16 ' 
Gavel: Examining the boundaary of mitochondrial targeting seq. 
motif at: 156 

HRSVTI 

Discrimination of mitochondrial target seq. : 

notclr ( 0.88) 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 

*** Reasoning Step: 2 

KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 156) from: 27 to: 44 Score: 5.0 
Mitochondrial matrix? Score: 0.36 
SKL motif (signal for peroxisomal protein) : 

pos: 99(163), count: 1 SKL 

SKL score (peroxisome) : 0.3 
Amino Acid Composition Tendency for Peroxisome: -4.28 
Peroxisomal proteins? Status: notclr 
Amino Acid Composition tendency for lysosomal proteins 

score: 0.02 Status: notclr 
Modified score for lysosome : 0 . 152 
Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 

FOXind: pos: 132 (5) KRKR 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.60 
Nuclear Signal Status: notclr ( 0.30) 
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Checking CaaX motif . . 
Checking N-niyristoylation. • 
Checking CaaX motif . . 

Final Results 

mitochondrial matrix space Certainty= 0.480 (Affirmative) < siico 

microbody (peroxisome) Certainty^ 0 . 300 (Affirmative) < suco 

nucleus Certainty^ 0.300 (Affirmative) < suco 

mitochondrial inner membrane Certainty= 0.188 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6040 are identified below. 
Position Potential Jury NGlyc 

agreement result 

2 NKTG (SEQ ID NO: 7255) 0.7804 (9/9) +++ 

106 NLTL (SEQ ID NO: 7256) 0.6123 (7/9) + 

Accordingly, the invention comprises a fragment of SEQ ID NO: 6040 wherein said 
fragment is at least ten amino acids and wherein isaid fragment comprises one or more of the 
asparagines from the amino acid positions of SEQ ID NO: 6040 selected from the group 
consisting of 2 and 106. The invention includes a fragment of SEQ ID NO: 6040 wherein said 
fragment comprises one or more amino acid sequences selected from the group consisting of 
SEQ ID NO: 7255 and SEQ ID NO: 7256. Preferably, the fragment comprises the amino acid 
sequence NKTG (SEQ ID NO: 7255). 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6040 wherein 
said fragment does not include one or more of the glycosylation sites identified above, the 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6040 are identified in Table 14. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7640-7800; (b) an amino acid sequence having sequence identity to ah amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 7640-7800, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 
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The invention provides the use of a polypeptide in the manufactuie of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
5 human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6041. SEQ ID NO: 6041 
demonstrates functional homology with a portion of an ORF lab polyprotein. The invention 

10 includes a polypeptide comprising an amino acid sequence having sequence identity to SEQ ID 
NO: 6041. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6041. 
The invention includes a polynucleotide sequence encoding an amino acid sequence having 
sequence identity to SEQ ID NO: 6041. The invention includes a polynucleotide encoding a 
fragment of a polypeptide comprising SEQ ID NO: 6041. 

15 The invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 

6041 or a fragment therof. The invention includes a diagnostic kit comprising a polynucleotide 
encoding SEQ ID NO: 6041 or a fragment thereof. The invention includes an inununogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6041 or a fragment thereof. The 
invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 6041 or 

20 a fragment thereof. 

The polyproteins of coronaviruses are associated with enzymatic activity. Accordingly, 
SEQ ID NO: 6041 is a target for screening of chemical inhibitors to the SARS virus. The 
invention includes a polypeptide comprising SEQ ID NO: 6041 or a fragment thereof. The 
invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID NO: 6041 or 

25 a fragment thereof. The invention includes a method of screening SEQ ID NO: 6041 for an 

inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6041 in a host cell. , 
The invention includes a snniall molecule which prevents the polypeptide of SEQ ID NO: 6041 
from performing enzymative activity. The invention includes a fusion protein wherein said 
fusion protein comprises SEQ ID NO: 6041. 

30 Predicted transmembrane or hydrophobic regions of SEQ ID NO: 6041 are identified 

below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramatically alter the architecture of host 
cell membranes. Accordingly, the hydrophobic domains of the polyprotein are targets for 

35 genetic mutation to develop attenuated SARS virus vaccines. The hydrophobic domains are also 
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targets for small molecule inhibitors of the SARS virus. The hydrophobic domains may also be 
used to generate antibodies specific to those regions to treat or prevent SARS virus infection. 

Possible tr-ansmembrane helices of SEQ ID NO: 6041 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 



Inside to outside helices : 18 found 





from 


to 


score 


center 


234 


( 234) 254 


( 250) 


1046 


241 


256 


( 256) 272 


( 270) 


252 


263 


319 


( 319) 334 


( 334) 


227 


327 


503 


( 505) 522 


( 519) 


405 


512 


. 613 


{ 615) 633 


( 629) 


619 


622 


677 


{ 679) 703 


( 696) 


467 


689 


849 


{ 851) 869 


( 865) 


229 


858 


1080 


(1080) 1097 


(1094) 


306 


1087 


1147 


(1149)1163 


(1163) 


354 


1156 


1557 


(1557)1581 


(1577) 


817 


1567 


1954 


(1954)1971 


(1971) 


832 


1964 


2369 


(2372)2395 


(2387) 


300 


2379 


2513 


(2513)2532 


(2529) 


690 


2522 


Outside to inside helices : 14 found 




from 


to 


score 


center 


239 


( 239) 254 


( 254) 


924 


247 


239 


( 248)' 272 


( 263) 


468 


256 


311 


( 314) 334 


( 328) 


267 


321 


499 


( 503) 522 


( 519) 


485 


512 


617 


( 617) 634 


( 631) 


425 


624 


849 


( 853) 872 


( 872) 


572 


864 


1147 


(1147)1162 


(1162) 


765 


1155 


1564 


(1564) 1581 


(1579) 


883 


1572 


1951 


(1951)1968 


(1966) 


657 


1958 


2513 


(2522)2539 


(2537) 


711 


2529 



Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6041, wherein said fragment comprises an amino acid sequence including one or more of the 
hydrophobic transmembrane sequences identified above. The invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of the 
foUowing jpolypeptide sequences of SEQ ID NO: 6041: 234-254, 613-633, 1557-1581, 1954- 
1971, 2513-2532, 239-254, 1564-1581, 1951-1968, 2513-2539. Preferably, the fragment 
comprises one or more of the following polypeptide sequences of SEQ ID NO: 6041 : 234-254 
and 239-254. The invention also includes polynucleotides encoding each of the polypeptide 
fragments identified above. 

The invention includes an attenuated SARS virus wherein said attenuated SARS virus 
contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
hydrophobic domains identified above. The invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
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viral genome of the SARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ ID NO: 6041 identified above. 

The invention includes an antibody which specifically identifies one or more of the 
hydrophobic regions of SEQ ID NO: 6041 identified above. The invention includes a small 
5 molecule which binds to, interferes with the hydrophobicity of or otherwise disrupts one or more 
of the hydrophobic regions of SEQ ID NO: 6041 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6041 are identified below: 

Position Potential Jury NGlyc 

agreement result 



571 


NIjSH 


(SEQ 


ID 


NO: 


7257) 


0, 


.6598 


(8/9) 


+ 


835 


NTSR 


(SEQ 


ID 


NO: 


7258) 


0. 


.5762 


(7/9) 


+ 


958 


NVTD 


(SEQ 


ID 


NO: 


7259) 


0. 


.7494 


(9/9) 




1113 


NISD 


(SEQ 


ID 


NO: 


7260) 


0. 


.7259 


(8/9) 


+ 


1205 


NSTL 


(SEQ 


ID 


NO: 


7261) 


0, 


.6296 


(9/9) 


++ 


1460 


NVTG 


(SEQ 


ID 


NO: 


7262) 


0, 


.6844 


(9/9) 




1685 


NHSV 


(SEQ 


ID 


,N0: 


7263) 


0, 


.5181 


(5/9) 


+ 


2029 


NKTT 


(SEQ 


ID 


NO: 


7264) 


0, 


.5423 


(5/9) 


+ 



Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 

20 acid sequence of SEQ ID NO: 6041; wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of sequences SEQ 
ID NOS: 7257-7264. Preferably, the fragment comprises one or more of the sequences SEQ ID ■ 
NOS: 7257, 7259, 7260, 7261 and 7262. The invention further comprises a polynucleotide 

25 encoding one or more of the polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6041 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ K) NO: 6041 are identified in Table 15. The invention includes a 

30 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7801-8040; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 

35 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 7801-8040, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-ceU antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 

40 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
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CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 

The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
5 is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
10 The invention includes a polypeptide sequence SEQ ID NO: 6043 or a fragment thereof. 

The invention includes a polypeptide comprising an amino acid sequence having sequence 
identity to SEQ ED NO: 6043. The invention includes a polynucleotide sequence encoding the 
amino acid sequence of SEQ ID NO: 6043 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6043 are set forth below. 

15 Inside to outside helices : 4 found 

from to score center 

41 ( 41) 56 ( 56) 1789 49 
76 ( 79) 99 ( 99) 2142 89 
105 { 105) 125 ( 125) 1250 115 

20 

Outside to inside helices : ■ 3 foxmd 

from to score center 

41 { 41) 59 ( 56) 2053 49 
76 ( 82) 98 ( 96) 1580 89 
25 103 ( 105) 125 ( 123) 1257 115 

The amino acid region with the highest predicted transmembrane helical region is from 
amino acid position 76 to 99 of SEQ ID NO: 6043. Such transmembrane regions are often 
difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
30 comprising a fragment of SEQ ID NO: 6043 wherein said fragment does not include one or more 
of the hydrophobic amino acid sequences identified above. Preferably, the fragment does not 
include the amino acids between positions 27 to 48. The invention also includes a 
polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6043 is predicted to be a hypothetical protein of the SARS virus. A 
35 prediction of the protein localization of SEQ ID NO: 6043 is set forth below. SEQ ID NO: 6043 
is predicted to be located in one of the following locations: mitochondrial inner membrane, 
plasma membrane, Golgi body, and mitochondrial intermembrane space. SEQ ID NO: 6043 
may be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO: 6043 is a target for screening of chemical inhibitors to the 
40 SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6043 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
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NO: 6043 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6043 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6043 
in a host cell. The invention includes a small molecule which prevents Ihe polypeptide of SEQ 
ID NO: 6043 from associating with an organelle inside of an infected cell. The invention 
includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 6043. 

PSORT Prediction of Protein Iiocalization Sites for SEQ ID NO: 6043 

version 6. 4 (WWW) 

Species classification; 4 
*** Reasoning Step: 1 

Preliminary Calculation of ALOM {threshold: 0.5) 
count: 3 

Position of the most N-terminal TMS: 40 at i=2 
MTOP : membrane topology (Hartmann et al . ) 

I (middle): 47 Charge dif fir ence(C-N) : 3.5 
McG; Examining signal sequence (McGeoch) 

Length of UR: 12 

Peak Value of UR: 1.41 

Net Charge of CR: 0 

Discriminant Score:' -4.67 
GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 3.44 

Possible cleavage site: 15 
>» Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmeitibrane regions (Klein et al.) 

count: 2 value: -6.90 threshold: -2.0 

INTEGRAL Likelihood = -6.90 Transmembrane 83 ' - 99 ( 78 - 101)- 

INTEGRAL Likelihood = -5.04 Transmembrane 40 - 56 ( 37 - 60) 

PERIPHERAL Likelihood = -0.32 ^ 

modified ALOM score: 1.48 
»> Likely a Type Illb membrane protein (Nexo Ccyt) 
Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 128 

MRCWLC 

Discrimination of mitochondrial target seq. : 

notcir (0.76) 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 

*** Reasoning Step: 2 

Type Illa or Illb is favored for ER memb. proteins 
KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 128) from: 39 to: 56 Score: 11.5 
»> Seems to have an intreumitochondrial signal 
Mitochondrial inner membreine? Score: 0.59 
Mitochondrial intermemb. space? Score: 0.22 
SKL motif (signal for peroxisomal protein) : 
pos: 92(274), count: 1 SHL 
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SKL score (peroxisome); 0.3 
Amino Acid Composition Tendency for Peroxisome: 4.78 
Peroxisomal proteins? Status: positive 
Amino Acid Composition tendency for lysosomal proteins 
5 score: 1.16 Status: notclr 

Type III proteins may be localized at Golgi 
Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
IV Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 

Check the Number of TMSs for typelll (plasma memb.) " " 

Checking N-myristoylation. . 

Final Results 

mitochondrial inner membrane Certainty^ 0 . 664 (Affirmative) < suco 

Plasma membrane — Certainty= 0 . 600 (Affirmative) < suco 
Golgi body — Certainty= 0 . 400 (Affirmative) < suco 
ZU mxtochondrial intermembrane space Certainty^ 0.362 (Affirmative) < suco 

Predicted N- and O- glycosylation sites of SEQ ID NO: 6043 are identified below. 
Position Potential Jury NGlyc 

25 227 NATF (SEQ ID NO: 7265) 0.6328 ^^i^^jpt^-^^-lt 

Residue No. Potential Threshold Assignment 

Thr 28 0.9095 0.6280 T 

Thr 32 0.8740 0.6595 T 

iU Thr 34 0.9058 0.6655 T 

Thr 170 0.6816 0.6600 T 

Thr 267 0.9240 0.5779 t 

Thr 268 0.7313 0.5708 T 

Thr 269 0.9859 0.5583 T 

iD Thr 270 0.8023 0.5492 T 

Ser 27 0.6930 0.6091 S 

Ser 252 0.6457 0.5977 S 
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Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6043. wherein said fragment comprises the N-glycosylation sites 
or O-glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6043 wherein said fragment comprises one or more of the N- 
glycosylation sites or O-glycosylation sites identified above. The invention further comprises a 
polynucleotide encoding one or more of the polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6043 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6043 are identified in Table 17. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8281-8486; (b) an amino acid sequence having sequence identity to an amino acid 
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sequence of (a). The invention fiuther comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
5 ID NOS: 8281-8486, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for rising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
10 virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a manmial (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
15 wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The inomune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6044. The invention 
includes a polypeptide comprising a fragment of SEQ ID NO: 6044 or a sequence having 
sequence identity to SEQ ID NO:206. The invention includes a polynucleotide encoding SEQ 
20 ID NO: 6044. 

SEQ ID NO: 6044 is identified as a hypothetical protein. Predicted ^hydrophobic or 

transmembrane regions of SEQ ID NO: 6044 are identified below: 

Inside to outside helices : 3 found 
from to score center 

25 1 ( 1) 17 < 15) 891 8 

47 ( 47) 66 ( 63) 221 56 

Outside to inside helices : 4 found 
from to score center 

30 1 { 4) 21 < 19) 599 11 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6044 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino acids between 
35 positions 1 to 19. The invention also includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. 

SEQ ID NO: 6044 is predicted to be a hypothetical protein of SARS vims. A prediction of 
the protein localization of SEQ ID NO: 6044 is set forth below. SEQ ID NO: 6044 is predicted 
to be located in one of the following locations: nucleus, mitochondrial matrix, lysosome (lumen). 
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and microbody (peroxisome). SEQ ID NO: 6044 may be associated with an organelle inside an 
infected cell. 

Accordingly, SEQ ID NO: 6044 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6044 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6044 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6044 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6044 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6044 from associating with an organelle inside of an infected cell. The invention 
includes a fusion protein wherein said fiision protein comprises SEQ ID NO: 6044. 

PSORT Prediction of Protein Ziocalization Sites for SEQ ID NO: 6044 

version 6.4 (WWW) 

154 Residues 

Species classification: 4 
*** Reasoning Step: 1 



Preliminary Calculation of ALOM (threshold: 0.5) 
count : 0 

20 McG: Examining signal sequence (McGeoch) 

Length of UR: 7 
Peak Value of UR: 1.06 
Net Charge of CR: 1 , 
Discriminant Score: -7.97 
25 GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -3.28 
Possible cleavage site: 34 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
30 calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 1.43 threshold: -2.0 
35 PERIPHERAL Likelihood = 1.43 

modified ALOM score: -1.19 
Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 151 
FRKKQV 

40 Discrimination of mitochondrial target seq. : 

notclr (-0.46) 

*** Reasoning Step: 2 

45 KDEL Covmt: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 151) from: 46 to: 50 Score: 5.0 

Mitochondrial matrix? Score: 0.36 

SKL motif (signal for peroxisomal protein) : 
50 pos: -1(154), count: 0 

Amino Acid Composition Tendency for Peroxisome: 0.61 

Peroxisomal proteins? Status: notclr 
AAC score (peroxisome) : 0.149 

Amino Acid Con^osition tendency for lysosomal proteins 
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score: 0.81 Status: notclr 
Modified score for lysosome: 0.231 
Checking the amount of Basic Residues (nucleus). 
Checking the 4 residue pattern for Nuclear Targeting 
5 Fovmd: pes: 134 (3) KHKK 

Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Fovmd: pos : 13 6 (3) KK VSTNLCTHSF RKKQV 
Final Robbins Score (nucleus): 0.60 
10 Checking the RNA binding motif (nucleus or cytoplasm) 

nuc modified. Score: 0.90 

Nuclear Signal Status: positive ( 0.70) . 
Checking CaaX motif . . 
Checking N-myristoylation. . 
15 Checking CaaX motif. . 

Final Results 

nucleus - — Cerfcaintya 0 . 880 (Affirmative) < suco 

mitocliondrial matrix space Certainty= 0.360 (A££ixmative) < suco 

20 lysosome (lumen) Certainty^ 0 .231(Af f izmatxve) < suco 

mierobody (peroxisome) Certainty^ 0.149 (Affirmative) < suco 

One predicted O-glycosylation site of SEQ ID NO: 6044 is identified at residue 4: 

Residue No. Potential Threshold Assignment 
25 Thr 4 0.6839 0 ! 6484 T 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6044, wherein said fragment comprises the O-glycosylation site 
identified above. The invention further comprises a polynucleotide encoding one or more of the 

30 polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6044 wherein 
said fragment does not include one or more of the glycosylation sites identified above; The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6044 are identified in Table 18. The invention includes a 

35 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8487-8665; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 

40 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8487-8665, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 

45 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
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viras. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against S ARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6045. The 
invention includes a polypeptide sequence comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 6045. The invention includes a polypeptide sequence comprising a 
fragment of SEQ ID NO: 6045. The invention includes a polynucleotide sequence encoding any 
of these polypeptides. 

SEQ ID NO: 6045 demonstrates fiinctional homology with the envelope or small 
membrane protein of coronavimses. The invention includes a diagnostic kit comprising a 
polypeptide comprising SEQ ID NO: 6045 or a fragment thereof. The invention includes a 
diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6045 or a fragment thereof. 
The invention includes an immunogenic composition comprising SEQ ID NO: 6045 or a 
fragment thereof. The invention includes an antibody which recognizes a polypeptide : 
comprising SEQ ID NO: 6045 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6045 are identified below: 
Inside to outside helices : 1 found 
from to score center 

17 ( 19) 33 < 33) -2881 26 . . 

Outside to inside helices : 1 found 
from to score center 

17 ( 17) 34 ( 34) 2981 27 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6045 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino acids between 
positions 17 to 34. The invention also, includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. In one embodiment, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6045 wherein said fragment does not include amino acid 
residues 1-34 of SEQ ID NO: 6045. 

Predicted protein Localization Site of SEQ ID NO: 6045 is below. 

PSORT Prediction of Protein lK>calization Sites for SEQ ID NO: 6045 

version 6.4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 
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Preliminary Calculation of ALOM (threshold: 0.5) 
count: 2 

Position of the most N-terminal TMS: 17 at i=l 
5 MTOP: membrane topology (Hartmann et al.) 

I (middle): 24 Charge diffirence (C-N> : 2.0 
McG: Examining signal sequence (McGeoch) 
Length of UR: 29 
Peak Value of UR: 3.40 
10 Net Charge of CR: -2 

Discriminant Score: 13.07 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 4.37 
Possible cleavage site: 32 
15 ... positive value of mtop ... 

»> Seems to have an uncleavable N-terro signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 1 ** thrshld changed to -2 
20 Cleavable signal was detected in ALOM? : OB 

ALOM: finding transmembrane regions (Klein et al.) 
count: 1 value: -15.12 threshold: -2.0 

INTEGRAL Likelihood =-15.12 Transmembrane 17 - 33 ( 8 - 44) 
PERIPHERAL Likelihood = 0.47 
25 modified ALOM score : 3 . 12 

»> Seems to be a Type lb (Nexo Ccyt) membrane protein 

The cytoplasmic tail is from 34 to 76 (44 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
JO Rule: vesicular pathway 

( 6) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 6 
Uncleavable? Ipos set to: 16 
J5 Discrimination of mitochondrial target seq. : 

notclr ( 0.19) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 

> Relative position of the end of the tail: 44% 

Memb. protein with uncleavable signl is often at ER 
I'S KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 16) from: 70 to: 99 Score: 21.5 

»> Seems to have an intramitochondrial signal 

SKL motif (signal for peroxisomal protein): 
'0 pos: -1(76), count: 0 

Amino Acid Composition Tendency for Peroxisome: -4.11 

Peroxisomal proteins? Status: negative 

Amino Acid Composition tendency for lysosomal proteins 
score: 0.68 Status: notclr 
■5 Checking the amotmt of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 

Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
0 Nuclear Signal Status: negative (0.00) 

Check cytoplasmic tail for typelb (plasma memb.) 
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Checking the NPXY motif . . 
Checking the YXRF motif . . 
Checking N-nvristoylation. . 

Final Results 

plasma membrane Certainty=. 0 .730 (Affirmative) < suco 

endoplasmic reticulum (membrane) Certainty^ 0.640 (Affirmative) < suco 

endoplasmic reticulum (lumen) — Certainty^ 0.100(Af f irmative) < suco 
outside Certainty^ 0.100 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6045 are identified at residues 48 and 66: 
Position Potential Jury NGlyc 
agreement result 
48 NVSI. 0.6514 (9/9) ++ (SEQ ID NO: 7266) 

66 NSSE 0.5880 (7/9) + (SEQ ID NO: 7267) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6045, wherein said fragment comprises one or more of the 
N-glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6045 wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention further comprises a polynucleotide encoding 
one or more of the polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6045 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6045 are identified in Table 19. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected fix)m the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8666-8820; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention farther comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides tiurough viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8666-8820, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (,e,g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated inmiune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 
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The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defmed above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6046. The 
invention includes polypeptide sequences comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 6046. The invention includes a polypeptide sequence comprising a 
fragment of SEQ ID NO: 6046. The invention includes a polynucleotide encoding one of these 
polypeptides. 

SEQ ID NO: 6046 has functional homology with a matrix protein of a coronavirus. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6046 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide encoding • 
SEQ ID NO: 6046 or a fragment thereof. The invention includes an immunogenic composition 
comprising SEQ ID NO: 6046 or a fragment thereof The invention includes an antibody which 
recognizes a polypeptide comprising SEQ ID NO: 6046 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6046 are identified below. 
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Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6046 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 18 to 38, 52 to 67 and 76 to 95. The invention 
also includes a polynucleotide sequence encoding any of the above-identified polypeptides. 

Predicted protein localization of SEQ ID NO: 6046 is set forth below. 

FSORT Prediction o£ Protein Xrf>calizablon Sites 

version 6. 4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 3 

Position of the most N-terminal TMS: 21 at i=l 
MTOP: membrane topology (Hartmann et al.) 

I (middle): 28 Charge diffirence (C-N) : 6.0 
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McG: Examining signal sequence (McGeoch) 
Length of UR: 1 
Peak Value of UR: 3.16 . 
Net Charge of CR: -3 
5 Discriminant Score: 2.21 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 4.29 
Possible cleavage site: 39 

positive value of mtop ... 

10 »> Seems to have an uncleavcJDle N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 _ . , 

Cleavable signal was detected in ALOM? : OB 
ALOM: finding transmembrane regions (Klein et al.) 
15 count: 3 value: -7.64 threshold: 0.5 

INTEGRAL Likelihood = -7.64 Transmembrane 21 - 37 ( 18 - 39) 
INTEGRAL Likelihood = -7.59 Transmembrane 50 - 66 ( 43 - 72) 
INTEGRAL Likelihood = -5.04 Transmembrane 79 - 95 ( 72 - 99) 
PERIPHERAL Likelihood =2.38 
20 modified ALOM score: 2.13 . 

»> Likely a Type Illb membrane protein (Nexo Ccyt) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
25 ( 2) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 2 
Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq. : 
30 negative (-4.16) 

Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

.35 *** Reasoning Step: 2 

Type Illa or Illb is favored for ER memb. proteins 
Memb. protein with uncleavable signl is often at ER 
KDEL Co\mt: 0 

40 Checking apolar signal for intramitochondrial sorting 

SKL motif (signal for peroxisomal protein) : 
pos: -1(221), count: 0 

Amino Acid Composition Tendency for Peroxisome: 5.01 

Peroxisomal proteins? Status: notclr 
45 Amino Acid Composition tendency for lysosomal proteins 

score: 2.30 Status: positive 

Type III proteins may be localized at Golgi 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
50 Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check the Nvonber of TMSs for typelll (plasma memb.) 
55 Checking N-myristoylation. . 

Final Results 

endoplasmic reticulum (membrane) Certaiaty= 0.685 (Affirmative) < sucO 

plasma membrane Certaxntys 0 . 640 (Affirmative) < sueo 

60 Golgi body Certainty= 0.460 (Affirmative) < suco 

endoplasmic reticulum (Ixmien) Certainty^ 0 .100 (Af f izaiatlve) < suco 
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One predicted N-glycosylation sites of SEQ ID NO: 6046 is identified at residue 4: 

Prediction ofN-elvcosvI ation sites 

Position Potential Jury NGlyc 
5 agreement result 

4 NGTI 0.8430 (9/9) +++ (SEQ ID NO: 7268) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6046, wherein said fragment comprises the N-glycosylation site 

10 identified above. The invention further comprises a polynucleotide encoding one or more of the 
polyp^tides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6046, wherein said fragment does not include the N-glycosylation site 
identified above. The invention includes a polynucleotide encoding such a fragment. 

15 A variant of SEQ ID NO: 6046 that is included within the invention is SEQ ID NO: 9963. 

Compared to SEQ ID NO: 6046, this sequence has Val at residue 72 instead of Ala. 

T-epitopes for SEQ ID NO: 6046 are identified in Table 20. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 

20 NOS: 8821-9018; Qb) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 

25 ID NOS: 8821-9018, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 

30 virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against S ARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
35 wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
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The invention includes a polypeptide sequence comprising SEQ ID NO 
fragment thereof or an amino acid sequence having sequence identity thereto, 
transmembrane regions of SEQ ID NO: 6047 are identified below. 
Inside to outside helices : 2 found 
from to score center 

7 ( 10) 29 ( 27) 729 17 

21 ( 24) 41 ( 41) 640 34 

Outside to inside helices : 2 found 
from to score center 

4 ( 4) 22 ( 19) 874 12 

22 ( 24) 41 ( 41) 499 31 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6047 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 4 to 22 and 22 to 41. The invention also includes 
a polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6047 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6047 is set forth below. SEQ ID NO: 6047 is predicted 
to be located in one of the following locations: plasma membrane, endoplasmic reticulum, Golgi 
body, and microbody (peroxisome). SEQ ID NO: 6047 may be associated with an organeUe 
inside an infected cell or with viral entry to a host cell. 

Accordingly, SEQ ID NO: 6047 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6047 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6047 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6047 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6047 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6047 from associating with an organelle inside of an infected cell or interacting with a 
host cell membrane. The invention includes a fusion protein wherein said fusion protein 
comprises SEQ ID NO: 6047. Predicted protein localization of SEQ ID NO: 6047 is set forth 
below. 

PSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
co\mt : 1 

Position of the most N-terminal TMS: 2 at i=l 
MTOP: membrane topology (Hartmann et al.) 

I (middle): 9 Charge diffirence (C-N) : 0.5 
McG: Examining signal sequence (McGeoch) 
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Length of URt 6 

Peak Value of UR: 3.08 

Met Chairge o£ CR: 0 

Discriminant Score: 5.12 
GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -4.45 

Possible cleavage site: 34. 
»> Seems to have an imcleavable N-term signal seq, 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : OB 
ALOM: finding transmembrane regions (Klein et al.) 

count: 1 value: -2.44 threshold: -2.0 

INTEGRAL Likelihood = -2.44 Transmembrane 2 - 18 ( 1 - 

PERIPHERAL Likelihood = 1.22 

modified ALOM score: 0.59 
»> Seems to be a Type II (Ncyt Cexo) membrane protein 

The cytoplasmic tail is from 1 to 1 (1 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Riule : vesicular pathway 
{ 5) or \mcleavable? 

Gavel: Examining the bovmdary of mitochondrial targeting seq 
motif at: 5 

Uncleavable? Ipos set to: 15 
Discrimination of mitochondrial target seq. : 

notclr ( 1.48) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 

Relative position of the cytoplasmic tail: 1% 

Larger value (>30%) is favared for ER memb. proteins 
Memb. protein with uncleavable signl is often at ER 

KDEL Cotint: 0 

Checking apolar signal for intramitochondrial sorting 

(Gavel position 15) from: 64 to: 93 Score: 30.0 
>» Seems to have an intramitochondrial signal 
SKL motif (signal for peroxisomal protein) : 

pos : -1(63), count : 0 
Amino Acid Composition Tendency for Peroxisome: 1.91 
Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.161 
Amino Acid Composition tendency for lysosomal proteins 

score: 0.04 Status: notclr 
Checking the consensus for Golgi 
Checking the consensus for Golgi 

Checking the cytoplasmic tail of type II. (Golgi) 
Checking the amo\mt of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Check mitochondrial signal for typell (plasma memb. ) 
Type II is favored for plasma memb. proteins 
Checking the NPXY motif.. 
Checking the YXRF motif -. . 
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Checking N-myristoylation. . 
Final Results 

plasma membrane Certainty= 0.685(A££ixiiiat:xve) < suco 

5 endoplasmic reticulum (membrane) Certainty^ 0.640 (Affirmative) < suco 

Golgi body Certainty= 0.370 (Affirmative) < suco 

microbody (peroxisome) Certainty- 0.161(Af f irmative) < suco 

T-epitopes for SEQ ID NO: 6047 are identified in Table 21. The invention includes a 
10 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9019-9131; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or deUvery 
15 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9019-9131, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a ft-agment of said antigen; (3) 
20 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a ■ 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

25 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said inmiune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6048, a fragment thereof or 

30 an amino acid sequence having sequence identity thereto. Predicted transmembrane regions of 
SEQ ID NO: 6048 are identified below. 



Inside to outside 


helices : 


2 found 


from 


to score 


center 


3 { 3) 18 ( 


18) 1857 


10 


100 ( 100) 117 ( 


115) 2904 


107 


Outside to inside 


helices : 


2 found 


from 


to score 


center 


1 ( 1) 15 ( 


15) 1299 


8 


100 { 100) 117 ( 


115) 3009 


107 



Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6048 wherein said fragment does not include one or more of the hydrophobic amino acid 
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sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 1 to 15 and 100 to 117. The invention also 
includes a polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6048 is predicted to be a hypothetical protein of SARS vims. A prediction of 
the protein localization of SEQ ID NO: 6048 is set forth below. SEQ ID NO: 6048 is predicted 
to be located in one of the following locations: plasma membrane, lysosome (membrane), 
microbody (peroxisome), and endoplasmic reticulum (membrane). SEQ ID NO: 6048 may be 
associated with an organelle inside an infected ceU or may interact with a host ceU plasma 
membrane during viral entry to the host cell. 

Accordmgly. SEQ ID NO: 6048 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6048 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6048 or a fragment tiiereof. The invention includes a metiiod of screening SEQ ID NO: 
6048 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6048 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6048 from associating with an organelle inside of an infected ceU or prevents the 
polypeptide from associating with the cell membrane of a host cell. The invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6048. Predicted protein 
localization of SEQ ID NO: 6048 is set forth below. 

PSORT — Prediction of Protein localization Sites 

version 6 . 4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 2 

Position of the most N-texminal TMS: 3 at i=2 
MTOP: membrane topology (Hartmann et al.) 

I (middle): 10 Charge diffirence(C-N) : -2.5 
McG: Examining signal secauence (McGeoch) 

Length of UR: 13 

Peak Value of UR: 3.38 

Net Charge of CR: 1 

Discriminant Score; 10.02 
GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 2.56 

Possible cleavage site: 15 
»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 16 
ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: IB 
ALOM: finding transmembrane regions (Klein et al.) 

count: 1 value: -14.75 threshold: -2.0 ' 

INTEGRAL Likelihood =-14.75 Transmembrane 101 - 117 ( 95 - 120) 
PERIPHERAL Likelihood = 6.63 t »s 120) 

modified ALOM score: 3.05 
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>>> Seems to be a Type la membrane protein 

The cytoplasmic tail is from 118 to 122 (5 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
(15) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting sea 

motif at: 15 ^' . 

Uncleavable? Ipos set to: 25 
Discrimination of mitochondrial target seg. : 

notclr (0.73) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step; 2 

KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
l\} (Gavel position 25) from: 3 to: 12 Score: 8.5 

SKL motif (signal for peroxisomal protein) : 

pos: -1(122), coxint: 0 
Amino Acid Composition Tendency for Peroxisome: 2 46 
AAC not from the N-term. , score modified 
15 Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.115 
Amino Acid Con^josition tendency for lysosomal proteins 
scpre: -0.40 Status: negative 
. GY motif in the tail of typela? (lysosomal) 
50 Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for NUclear Targeting 
Checking the Robbins t Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
>0 Nuclear Signal Status: negative (0.00) 

Type la is favored for plasma memb. proteins 
Checking the NPXY motif . . 
Checking the YXRF motif.. 
Checking N-itvristoylation. . 
K) Checking GPI anchor . . 

>» Seems to be GPI-anchored (0,85) 

Pinal Results 

plasma membrane — Certainty= 0.919 (Affirmative) < suco 
^D lysosome (membrane) — Certainty^ 0.200 (Affirmative) < sueo 

microbody (peroxisome) — Certainty- 0 . 115 (Affirmative) < suco 
endoplasmic reticulum (membrane) Certainty- 0.100 (Affirmative) < suco 

T-^pitopes for SEQ ID NO: 6048 are identified in Table 22. The invention includes a 
:0 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9132-9308; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
5 of such polynucleotides through viral vectors and/or viral particles. The invention further 
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comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9132-9308, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
5 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against S ARS viral infection wherein the polypeptide 
is as defined above. 

10 The invention provides a method of raising an immune response in a mammal (typically a 

himian), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6049, a fragment thereof or 

15 an amino acid sequence having sequence identity thereto. Predicted transmembrane or 

hydrophobic regions of SEQ ID NO: 6049 are identified below. 

Inside to outside helices : 1 found 
from to score center 

13 ( 13) 30 { 28) 3532 20 

20 

Outside to inside helices : 1 found 
from to score center 

9 ( 11) 29 { 26) 3395 19 

25 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 

6049 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6049 is predicted to be a hypothetical protein of SARS virus. A prediction of 

30 the protein localization of SEQ ID NO: 6049 is set forth below. SEQ ID NO: 6049 is predicted 
to be located in one of the following locations: outside, microbody (peroxisome), endoplasmic 
reticulum (membrane) and endoplasmic reticulum Qumen), The highest ranking indicates that 
SEQ ID NO: 6049 is located on the outside of a cell. Accordingly, SEQ ID NO: 6049 may be a 
surface exposed protein. 

55 Accordingly, SEQ ID NO: 6049 may be used in an inmiunogenic composition to raise an 

immune response against the SARS vims. It also may be used to generate antibodies specific to 
the SARS virus. Such antibodies may be used in a method of treatment or prevention of a SARS 
virus infection. Such antibodies may further be used in a diagnostic test to identify the presence 
or absence of SARS virus in a biological sample. 
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The invention includes a polypeptide comprises SEQ ID NO: 6049 or a fragment thereof. 
The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID NO: 
6049 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 6049 for 
an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6049 in a host 
cell. The invention includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 
6049. Predicted protein localization of SEQ ID NO: 6049 is set forth below. 
PSORT — Prediction of Protein Localization Sites 

.species Classification: 4 • version 6 . 4 (WWW) ^ - 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold- 0 5) 
count: 1 . 



Position of the most N- terminal TMS: 11 at i=l 
MTOP: membrane topology (Hartmann et a J . ) 

I (middle): 18 Charge dif f irence (C-N) : -2 0 
McG: Examining signal sequence (McGeoch) 
Length of UR: 24 
20 Peak Value of UR: 3.69 

Net Charge of CR: -2 
Discriminant Score: 13.56 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 0.52 
25 Possible cleavage site: 25 

»> Seems to have a cleavable N-term signal seq 
Amxno Acid Composition of Predicted Mature Form- 

calculated from 26 
ALOM new cnt: 1 ** thrshld changed to -2 
JU Cleavable signal was detected in ALOM?: IB 

ALOM: finding transmembrane regions (Klein et al ) 
count: 0 value: 14.80 threshold: -2 0 
PERIPHERAL Likelihood =14.80 
modified ALOM score: -3.86 
>5 Rule: vesicular pathway 

Rule: vesicular pathway 
Rule: vesicular pathway 
( 2) or uncleavable? 
\0 ^^""^^ mS'^at"^2''^^ boundary of mitochondrial targeting seq. 

Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq • 

notclr (1.42) 
Rule: vesicular pathway 
•5 Rule: vesicular pathway 

Rule: vesicular pathway 

*** Reasoning Step: 2 

0 KDEL Count: 0 

Number of Potential N-glycosylation Sites: 0 
Out: score 0.800 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 12) from: 44 to: 73 Score: 30 0 
^ »> Seems to have an intramitochondrial signal 

SKL motif (signal for peroxisomal protein) • 
pos: -1(44), count: 0 
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Aniino Acid Composition Tendency for Peroxisome: 9.47 

AAC not from the N-term. , score modified 
Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.320 
Amino Acid Composition tendency for lysosomal proteins 

score: -6,47 Status: negative 
Number of NX(S/T) motif: 0 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Checking CaaX motif . . 
Checking N-myristoylation . . 
Checking Caax motif.. 

Final Results 

outside Certainty= 0 . 820 (Affirmative) < suco 

microbody (peroxiscxme) Certainty= 0.320 (Affirmative) < suco 

endoplasmic reticulum (membrane) Certainty= 0 . 100 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty^ 0 .100 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6049 are identified in Table 23. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a): an amino acid 
sequence selected fi-om the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9309-9437; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9309-9437, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex.between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated inraiune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in tiie manufacture of a medicament for 
inununising a manmial (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an inmiune response in a manunal (typically a 
human), comprising the step of administering to the manmial a polypeptide as defined above, 
wherein said inmiuhe response is a cell-mediated inomune response and, preferably, a CTL 
response. The inunune response is preferably protective or therapeutic. 
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The invention includes a polypeptide comprising SEQ ID NO: 6050 or a fragment thereof 
or an amino acid sequence having sequence identity thereto. Predicted transmembrane or 
hydrophobic regions are identified below. 

Inside to outside helices : 1 found 
5 from to score center- 

IB { 15) 32 ( 30) 558 23 

Outside to inside helices : 1 found 
from to score center 

10 16 ( 16) 30 ( 30) 364 23 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6050 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. The invention also includes a polynucleotide sequence encoding any 

15 of the above-identified polypeptides. 

SEQ ID NO: 6050 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6050 is set forth below. SEQ ID NO: 6050 is predicted 
to be located in one of the foUowing locations: lysosome (lumen), mitochondrial matrix space, 
mitochondrial inner membrane, and mitochondrial intennembrane space. SEQ ID NO: 6050 

20 may be associated witii an organelle inside an infected cell during the viral rephcation cycle. 

Accordingly, SEQ ID NO: 6050 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6050 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID '= 
NO: 6050 or a fragment tiiereof. The invention includes a method of screening SEQ ID NO: 

25 6050 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6050 
m a host ceU. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6050 from associating with an organeUe inside of an infected cell or prevents the 
polypeptide from associating with tiie cell membrane of a host cell. The invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6050. Predicted protein 

50 localization of SEQ ID NO: 6050 is set forth below. 

PSORT Prediction of Protein Localization Sites 

MV<3Pn o/i « version 6.4<WWW) 

'"SEQ 84 Residues 

^ Species classification: 4 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM {threshold: 0 5) 
count : 0 

O McG: Examining signal sequence (McGeoch) 

Length of UR: 3 

Peak Value of UR: 1.46 

Net Charge of CR: 2 

Discriminant Score: -5,73 
5 GvH: Examining signal sequence (von Heijne) 
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Signal Score (-3.5): -0.12 
Possible cleavage site: 29 
>» Seems to have no N-terminal signal seg. 
Amino Acid Composition of Predicted Mature Fom: 
5 calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : OB 
ALOM: finding transmeihbrane regions (Klein et al.) 
count: 0 value: 8.43 threshold: -2.0 
10 PERIPHERAL Likelihood =8.43 

modified ALOM score: -2.59 
Gavel: Examining the boundary of mitochondrial targeting seq. _ .- 

motif at: 61 
ARCWYL 

15 Discrimination of mitochondrial target seg. : 

positive ( 1.66) 

Rule : mitochondrial protein 

Rule : mitochondrial protein 

Rule: mitochondrial protein 
20 Rule: mitochondrial protein 

*** Reasoning Step: 2 

KDEL Count: 0 

25 "Checking apolar signal for intramitochondrial sorting 

(Gavel position 61) from: 52 to: 58 Score: 6.0 
Mitochondrial matrix? Score: 0.38 
• SKL motif (signal for peroxisomal protein): 
pos: -1(84), coxint: 0 
30 Amino Acid Composition Tendency for Peroxisome: 1.47 

Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.263 
Amino Acid Composition tendency for lysosomal proteins 
score: 2.86 Status: positive 
35 Modified score for lysosome: 0.850 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking, the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
40 Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 
Checking "Caax motif.. 
Checking N-myristoylation . . 
Checking CaaX motif.. 

45 

Final Results 

lysoscane (lumen) Certaintys 0.850(A££ixmatlve) < suco 

mitocliondirlal nafez-ix space Certainty^ 0.544 (Affirmative) < suco 

nitoehondrial inner menibrane Certainty^ 0.266 (Affirmative) < suco 

50 mitochondrial intermembrane space Certainty^ 0.266 (Affirmative) < suco 

One predicted N-glycosylation sites of SEQ ID NO: 6050 is identified at residue 43: 

Position Potential Jury NGlyc 
agreement result 
55 43 NVTI 0.6713 (9/9) ++ (SEQ ID NO: 7269) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6050 wherein said fragment comprises the N-glycosylation site 
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identified above. The invention further comprises a polynucleotide encoding one or more of the 
polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6050 wherein said fragment does not include the N-glycosylation site 
identified above. The invention includes a polynucleotide encoding such a firagment. 

T-epitopes for SEQ ID NO: 6050 are identified in Table 24. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected firom the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9438-9538; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9438-9538, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a ceU-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
inomunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated inoutnune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6051 or a 
fragment thereof or an amino acid sequence having sequence identity thereto. The invention 
includes a polypeptide sequence comprising SEQ ID NO: 6052 or a fiagment thereof or an 
amino acid sequence having sequence identity thereto. 

SEQ ID NO: 6051 and SEQ ID NO: 6052 demonstrate functional homology with a 
nucleocapsid protein of a coronavirus. The invention includes a diagnostic kit comprising a 
polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The 
invention includes a diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6051, 
SEQ ID NO: 6052 or a fragment thereof. The invention includes an immunogenic composition 
comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The invention includes 
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an antibody which recognizes a polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 
or a fragment thereof. 

SEQ ID NO: 605 1 is predicted to be phosphorylated at Ser-79; Thr-92; Ser-106; Thr-1 16; 
Thr-142; Ser-184; Ser-188; Ser-202; Ser-236; Thr-248; Ser-251; Ser-256; Thr-377. 
5 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6051 
wherein said fragment includes one or more of the amino acid residues of SEQ ID NO: 6051 
selected from the group consisting of Ser-79; Thr-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser- 
188; Ser-202; Ser-236; Thr-248; Ser-251; Ser-256; Thr-377. The invention farther includes a 
polypeptide comprising a fragment of SEQ ID NO: 6051 wherein said fragment does not include 

10 one or more of the apoino acid residues of SEQ ID NO: 6051 selected from the group consisting 
of Ser-79; Thr-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser-188; Ser-202; Ser-236; Thr-248; 
Ser-251; Ser-256; Thr-377. Two further useful fragments of the N protein {e.g. for 
immunoassay) are SEQ ID NOS: 9783 & 9784, which are lysine-rich and can be used to 
distinguish the SARS virus from other coronaviruses. 

15 Predicted transmembrane regions of SEQ ID NO: 6051 are identified below. 

Inside to. outside helices :. 1 found 
from to score center 

304 { 304) 323 ( 319) 495 312 

20 Outside to inside helices : 2 found 

from to score center 

304 ( 304) 319 ( 319) 597 312 ' 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
25 6051 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

Predicted protein localization of SEQ ID NO: 6051 is set forth below. SEQ ID NO: 6051 
is predicted to be localized near the nucleus, lysosome (lumen), mitochondrial matrix space, and 
30 microbody (peroxisome). The highest ranking is for localization near the nucleus. Coronavirus 
nucleocapsid proteins are known to bind to viral RNA. Coronavirus nucleocapsid proteins are 
also thought to be important for cell mediated immunity. Accordingly, the invention includes a 
polynucleotide comprising SEQ ID NO: 6051. The invention fiirther includes a viral vector or 
particle suitable for in vivo delivery of the polynucleotide sequence comprising a SARS virus 
35 nuceocapsid polynucleotide sequence or a fragment thereof. In one embodiment, the 

polynucleotide comprises SEQ ID NO: 6051 or a fragment thereof. The invention further 
includes a method for eliciting a cell mediated immune response comprising delivering a 
polynucleotide encoding a SARS vims nucleocapsid protein or a fragment thereof to a mammal. 
In one embodiment, the polynucleotide comprising SEQ ID NO: 6051 or a fragment thereof. 
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The invention further includes a method of screening SEQ ID NO: 6051 for an inhibitor. 

The invention includes the recombinant expression of SEQ ID NO: 6051 in a host cell. The 

invention includes a small molecule which prevents the polypeptide of SEQ ID NO: 6051 from 

binding to SARS virus RNA during viral replication. The invention includes a fusion protein 

wherein said fusion protein comprises SEQ ID NO: 6051. PrecKcted protein localization of SEQ 

ID NO: 6051 is set forth below. 

PSORT — Prediction of Protein Localization Sites 

c^^^ . , . ^ . . version 6 . 4 (WWW) 

Species classification: 4 

*** Reasoning Step: 1 



Preliminary Calculation of ALOM (threshold: 0.5) 
cotmt : 0 

15 McG: Examining signal sequence (McGeoch) 

Length of UR: 3 
Peak Value of UR: 0.19 
Net Charge of CR: 0 
Discriminant Score: -15.98- 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): -6.36 
Possible cleavage site: 58 
>» Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form- 
• calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein at al > 
count: 0 value: 5.04 threshold: -2.0 
PERIPHERAL Likelihood = 5.04 
modified ALOM score: -1.91 
. Gavel: Ex^ining the boundary of mitochondrial targeting seq. 
motif at: 17 
PRITFG 

35 Discrimination of mitochondrial target seq • 

negative (-3.97) 

*** Reasoning -Step: 2 

W KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 

Mitochondrial matrix? Score: 0,10 

SKL motif (signal for peroxisomal protein): 

Pos: -1(399), count: 0 
V5 Amino Acid Composition Tendency for Peroxisome: 0.04 

Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.072 
Amino Acid Composition tendency for lysosomal proteins 

score: 0.96 Status: notclr 
»0 Modified score for lysosome: 0.246 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 

Foxind: pos: 256 (4) KKPR 

Found: pos: 372 (5) KKKK 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Found: pos: 372 (3) KK KKTDEAQPLP QRQKK 
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Foiind: pos: 373 (3) KK KTDEAQPLPQ RQKKQ 
Pinal Robbins Score (nucleus) : 0.80 

Checking the RNA binding motif (nucleus or cytoplasm) 

nuc modified. Score: 0.90 

Nuclear Signal Status: positive ( 0.90) 

Checking CaaX motif.. 

Checking N-myristoylation. . 

Checking CaaX motif . . 

Pinal Resull:s 

nucleus Certainty= 0.980 (Affirmative) < suco 

lysosome (lumen) Certainty= 0 .246 (Affirmative) < suco 

mitochondrial matrix space — Certaxnty= 0 . 100 (Affirmative) < suco 
microbody (peroxisome) Certainty= 0 . 072 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6051 are identified below. 
Position Potential Jury NGlyc 
agreement result 
48 NNTA 0.6879 (9/9) ++ (SEQ ID NO: 7270) 

270 NVTQ 0.7684 (9/9) +++ (SEQ ID NO: 7271) 

Residue No. Potential Threshold Assignment 
Thr lee 0.8547 0.6439 T 

Thr 367 0.5575 0.5403 T 

Thr 394 0.8217 0.5821 T 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 605 1 wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention further comprises a polynucleotide encoding 
one or more of the polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6051 wherein said fragment does not include one or more of the 
N-glycosylation sites identified above. The invention includes a polynucleotide encoding such a 
firagment. 

T-epitopes for SEQ ID NO: 6052 are identified in Table 25. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9539-9752; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention fiirther 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9539-9752, or a polynucleotide encoding such a polypeptide. 

A variant of SEQ ID NO: 6052 that is included within the invention is SEQ ID NO: 9964. 
Compared to SEQ ID NO: 6052, this sequence has lie at residue 54 instead of Thr. 
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The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides tiie use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein tiie polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said inmiune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a composition comprising a SARS virus nucleocapsid protein or a 
fragment tiiereof and further comprising a SARS virus membrane protein or a fragment thereof. 
The composition may further comprising one or more adjuvants discussed below. 

The invention further includes a composition comprising a polypeptide comprising SEQ 
ID NO: 6051 or a fragment tiiereof or a sequence having sequence identity thereto and further 
comprising a polypeptide comprising SEQ ID NO: 6040, or a fragment tiiereof or a sequence 
having sequence identity thereto. Such composition may be used, for instance, in a vaccine. 
Such composition may further comprise one or more adjuvants discussed below. 

The invention includes a composition comprising a SARS virus nucleocapsid protein or a 
fragment thereof and a SARS virus spike protein or a fragment thereof. In one embodiment tiie 
nucleocapsid protein comprises a polypeptide sequence comprising SEQ ID NO: 6051 or a 
fragment thereof or a sequence having sequence identity thereto. In one embodiment, the spike 
protein comprises a polynucleotide comprising SEQ ID NO: 6042 or a fragment tiiereof or a 
sequence having sequence identity tiiereto. The composition may further comprise one or more 
of the adjuvants discussed below. 

The invention further includes a composition comprising antibodies specific to a SARS 
virus nucleocapsid protein and comprising antibodies specific to a SARS virus spike protein. In 
one embodiment tiie antibody is specific to a nucleocapsid protein comprises a polypeptide 
sequence comprising SEQ ID NO: 6051 or a fragment thereof or a sequence having sequence 
identity thereto. In one embodiment, tiie antibody is is specific to a spike protein comprises a 
polynucleotide comprising SEQ ID NO: 6042 or a fragment thereof or a sequence having 
sequence identity thereto. 

The invention further includes polynucleotide sequences, and fragments tiiereof, of a 
SARS virus which are conserved among coronaviruses, and polypeptides encoded thereby. Such 
conserved sequences can be identified in the alignments shown in HGURE 7. Such conserved 
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sequences may be used in the vaccines of the invention or in the diagnostic reagents, kits and 
methods of the invention. 

The invention further includes polynucleotide sequences, and fragments thereof, of a 
SARS virus which are specific to SARS virus and not shared with coronaviruses. Such SARS 
specific sequences are also identified as SEQ K) NOS: 6040, 6043, 6044, 6047, 6048, 6049 and 
6050. Such SARS specific sequences may be used in the vaccines of the invention or in die 
diagnostic reagents, kits and methods of the invention. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified inSEQ ED NOS: 6076-6265 (Table 5). The invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6076-6265. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS viias in a biological sample. The 
invention mcludes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6266-6343 (Table 6). The invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6266-6343. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6344-6392 (Table 7). The invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6344-6392.. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6393-6559 (Tables 8 & 9). The invention further includes 
polynucleotide sequence comprising the complement of one or more of the primer sequences 
identified in SEQ ID NOS: 6393-6559. 
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The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify tiie presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or more of tiie primer and probe 
sequences identified in SEQ ID NOS: 6560-6568. The invention further includes polynucleotide 
sequence comprising the complement of one or more of tiie primer sequences identified in SEQ 
ID NOS: 6560-6568. 

The invention includes a polypeptide sequence comprising any one of even-numbered SEQ 
ID NOS: 7272-7290, or a fragment tiiereof, or a sequence having sequence identity tiiereto. The 
invention further includes a polynucleotide sequence encoding any one of even-numbered SEQ 
ID NOS: 7272-7290. or a fragment thereof, or a sequence having sequence identity thereto. 
Examples of such polynucleotide sequences are odd-numbered SEQ ID NOS: 7273-7291. 

The invention includes a polynucleotide sequence comprising an intergenic sequence 
which is common to each open reading frame of the SARS virus. The SARS vims is tiiought to 
use this sequence to signal translation of tiie open reading frame. The intergenic sequence 
comprises a lOmer SEQ ID NO: 7292, or optionally a hexamer SEQ ID NO: 7293. When the .. 
virus transcribes its positive (+) RNA strand to (-) RNA sti^d, tfie virus lepUcating stmctuie 
uses the (-) strand template to transcribe nucleotides at tiie 5' end prior to tiie first intergenic 
sequence, followed by tiie intergenic sequence, followed by tfie selected open reading frame. 
The virus tiien creates multiple mRNAs comprising tiie 5' end, tiie intergenic sequence and 
coding sequence. For more details on Nidovriales replication (including Coronavirus) see e.g., 
Ziebuhr et al., "Virus-encoded proteinases and proteolytic processing in tiie Nidovirales", 
Journal of General Virology 81:853-879 (2000), incorporated herein by reference in its entirety. 

The invention comprising a polynucleotide sequence comprising SEQ ID NO: 7292 or tiie 
complement tiiereof. The invention comprising a polynucleotide sequence comprising SEQ ID 
NO: 7293 or the complement thereof. The invention further comprises a polynucleotide 
sequence comprising nucleotides from tiie 5' end of tiie SARS viral genome, or its reverse 
complement, and further comprising an intergenic sequence or its reverse complement. The 
polynucleotide may further comprise one or more of tiie SARS virus open reading frames. 
Examples of polynucleotide sequences comprising nucleotides from tiie 5' end of the SARS virus 
genome followed by tfie intergenic sequence are SEQ ID NOS: 7294-7301.- 

The invention includes a polynucleotide sequence comprising a sequence selected from tiie 
group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 
7295. SEQ ID NO: 7296, SEQ ID NO: 7297. SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
NO: 7300 and SEQ ID NO: 7301, or a fragment tiiereof, or a sequence having sequence identity 
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thereto. In one embodiment, the polynucleotide does not consist entirely of a known S ARS virus 
sequence. 

The SARS virus intergenic sequence can be used to create a RNAi molecule. Such a 
SARS virus specific RNAi molecule can be used to treat SARS virus infection. The invention 
includes a RNAi molecule comprising a double stranded RNA molecule wherein one RNA 
strand comprises a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID 
NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ 
ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301 , or a fragment 
thereof. Preferably, said RNA strand comprises a sequence selected from the group consisting of 
SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the other RNA strand comprises the 
reverse complement of the first strand or a polynucleotide sequence which hybridizes to the first 
strand. 

The invention includes the use of RNAi in a method of treatment for SARS virus infection 
comprising administering to a mannmal an effective amount of the si RNA molecule. Preferably, 
the RNAi molecule comprises the molecule described above. Further discussion of the RNAi 
applications of the intergenic sequence is included in section IV of the specification below. 

The invention also includes the use of a SARS virus antisense nucleotide sequence, 
preferably antisense directed to the SARS virus intergenic sequence. Such an antisense sequence 
may be used in the treatment of a subject infected with the SARS virus. The antisense of the 
SARS virus intergenic sequence can be designed to bind to the SARS viral polynucleotides to 
block access of the viral replication machinery to the intergenic sequence. Such an antisense 
sequence may also be used to identify the presence or absence of a SARS virus in a biological 
sample. The antisence can itself be labeled or the antisense associated with viral polynucleotides 
can be detected by means known in the art. 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 
sequence can interfere with expression of the corresponding gene. Antisense x)olynucleotides 
will bind and/or interfere with the translation of the corresponding wKNA. 

The invention also includes the use of the intergenic region with a ribozyme. 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and the 
target message must contain a specific nucleotide sequence. They are engineered to cleave any 
RNA species site-specifically in the background of cellular RNA. The cleavage event renders the 
mRNA unstable and prevents protein expression. Importantly, ribozymes can be used to inhibit 
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expression of a gene of unknown function for the purpose of determining its function in an in 
vitro or in vivo context, by detecting the phenotypic effect. 

One commonly used ribozyme motif is the hammerhead, for which the substrate sequence 
requirements are minimal. Design of the hammerhead ribozyme is disclosed in Usman et al. 
Current Opin. Struct. Biol (1996) 6:527-533. Usman also discusses the therapeutic uses of 
ribozymes. Ribozymes can also be prepared and used as described in Long et al, FASEB J. 
(1993) 7:25; Symons, Ann. Rev. Biochem. (1992) 61:641; Penotta et al., Biochem. (1992) 31:16- 
17; Ojwang et al, Proc. Natl Acad. Sci. (USA) (1992) 89:10802-10806; and US Patent 
5,254,678. Ribozyme cleavage of HIV-I RNA is described in US Patent 5,144,019; methods of 
cleaving RNA using ribozymes is described in US Patent 5,116,742; and methods for increasing 
the specificity of ribozymes are described in US Patent 5,225,337 and Koizumi et al. Nucleic 
Acid Res. (1989) 17:7059-7071. Preparation and use of ribozyme fragments in a hammerhead 
structure are also described by Koizumi et al. Nucleic Acids Res. (1989) 17:7059-7071.: 
Preparation and use of ribozyme fragments in a hairpin structure are described by Chowrira & 
Burke, Nucleic Acids Res. (1992) 20:2835. Ribozymes can also be made by rolling transcription 
as described inDaubendiek & Kool, Nat. Biotechnol (1997) 15(3):273-277. . 

The hybridizing region of the ribozyme may be modified or may be prepared as a branched 
structure as described in Horn & Urdea, Nucleic Acids Res. (1989) 17:6959-67. The basic 
structure of the ribozymes may also be chemically altered in ways familiar to those skilled in the 
art, and chemically synthesized ribozymes can be administered as synthetic oligonucleotide 
derivatives modified by monomeric units. In a therapeutic context, liposome mediated delivery 
of ribozymes improves cellular uptake, as described in Birikh et al, Eur. J. Biochem. (1997) 
245:1-16. 

Therapeutic and functional genomic applications of ribozymes proceed beginning with 
knowledge of a portion of the coding sequence of the gene to be inhibited. In the present 
invention, the target sequence preferably comprises the intergeneic sequence of the SARS virus. 
Preferably, the sequence is selected from the group consisting of SEQ ID NO: 7292 and SEQ ID 
NO: 7293. A target cleavage site is selected in the target sequence, and a ribozyme is 
constructed based on the 5' and 3* nucleotide sequences that flank the cleavage site. Preferably, 
the 5' nucleotide sequence includes the 5' untranslated region of the SARS virus. The ribozyme 
may then further be constructed from one or more of the polynucleotide sequences selected from 
the group consisting of SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 
7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. 

Antisense treatment of HIV infection is described in the following references, each of 
which is incorporated herein by reference in their entirety, (antisense RNA complementary to 
the mRNA of gag, tat, rev, env) (Sezakiel et al, 1991, J. Virol. 65:468-472; Chatterjee et al., 
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1992, Science 258:1485-1488; Rhodes et al, 1990, J. Gen. Virol. 71:1965. Rhodes et al., 1991, 
AIDS 5:145-151; SezaMel et al, 1992, J. Virol. 66:5576-5581; Joshi et al, 1991. J. Virol. 
65:5524-5530). 

The invention includes the use of decoy RNA to disrupt the SARS virus replication and 
5 life cycle. Methods of making and using such decoy RNA for treatment of a viral infection are 
known in the art. The invention includes delivery of genes encoding, for example, the SARS 
virus intergenic sequence, to infected cells. Preferably, the sequence comprises one or more of 
the sequences selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293. SEQ 
ID NO: 7294. SEQ ID NO: 7295, SEQ ID NO: 7296. SEQ ID NO: 7297, SEQ ID NO: 7298, 

10 SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. Preferably, the sequence 

comprises one or more of the sequences selected from the group consisting of SEQ ID NO: 7292 
and SEQ ID NO: 7293. Preferably, the sequence comprises SEQ ID NO: 7293. 

In the present invention, delivery of intergenic sequence which is not linked to the SARS 
virus open reading frames disrupts the translation process of the viral RNA and decreases the 

15 production of vial proteins. Similar methods of treatment for HIV viral infection have been 
described. The following references discuss the use of decoy RNA of HIV TAR or RRE for 
treatment of HTV infection. Each of these references is incorporated herein by reference in their 
entirety. (SuUenger et al, 1990, Cell 63:601-608; Sullenger et al, 1991, J. Virol. 65:6811-6816; 
lisziewicz et al., 1993, New Biol. 3:82-89; Lee et al, 1994, J. Virol. 68:8254-8264), ilbozymes ' 

20 (Sarver et al, 1990, Science 247:1222-1225; Wecrasinghe et al, 1991, J. Virol. 65:5531-5534; : 
DropuUc et al, 1992, J. Virol. 66:1432-1441; Ojwang et al, 1992, Proc. Natl. Acad. Sci. USA. 
89:10802-10806; Yu et al., 1993, Proc. Natl. Acad. Sci. USA. 90:6340-6344; Yu et aZ., 1995, 
Proc. Natl. Acad. Sci. USA. 92:699-703; Yamada et al, 1994, Gene Therapy 1:38-45). 
The invention includes the use of the SARS virus intergenic sequence in diagnostic 

25 reagents, kits (comprising such reagents) and methods which can be used to diagnose or identify 
the presence or absence of a SARS virus in a biological sample. Such diagnostic reagents, kits, 
and methods are further discussed in Section II of the speciHcation. 

The invention includes a pair of primers for amplifying a SARS polynucleotide sequence 
comprising (i) a first primer comprising a sequence which is substantially identical to a portion 

30 of a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ 
ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, 
SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301 and (ii) a second primer 
comprising a sequence which is substantially complementary to a portion of a sequence selected 
from the group consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2, such 

35 that the primer pair (i) and (ii) defines a template sequence within a sequence from the group 
consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2. Preferably, the (i) 
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first primer comprises a sequence which is substantially identical to a portion of a sequence 
selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The amplicon defined by said first and second primers is preferably 
5 between 50 and 250 nucleotides in length. The primers may optionally be labeled to facilitate 
their detection. Methods and compositions for use in labeling primers are discussed further in 
the application in Section m. 

The invention further includes a pair of primers for amplifying a SARS polynucleotide 
sequence comprising (i) a first primer comprising a sequence which is substantially identical to a 

10 portion of the complement of a portion of a sequence selected from the group consisting of SEQ 
ID NO: 7292, SEQ ID NO: 7293. SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, 
SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 
7301 and (ii) a second primer comprising a sequence which is substantially complementary to a ■ 
portion of the complement of a sequence selected from the group consisting of the sequence SEQ 

15 ID NO: land the sequence SEQ ID NO: 2, such that the primer pair defines a template sequence . 
within a sequence selected from the group consisting of the sequence SEQ ID NO: 1 and the 
sequence SEQ ID NO: 2. The amplicon defined by said first and second primers is preferably 
between 50 and 250 nucleotides in length. The primers may optionally be labeled to facilitate 
their detection.. Methods and compositions for use in labeling primers are discussed further in 

20 the application in Section in. 

. The invention includes a kit comprising (i) a first primer comprising a sequence which is 
substantially identical to a portion of a sequence selected from the group consisting of SEQ ID 
NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295. SEQ ID NO: 7296, SEQ 
ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301 

25 and (ii) a second primer comprising a sequence which is substantially complementary to a 

portion of a sequence selected from the group consisting of the sequence SEQ ID NO: 1 and the 
sequence SEQ ID NO: 2, such that the primer pair (i) and (ii) defines a template sequence within 
a sequence from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2. Preferably, the (i) 
first primer comprises a sequence which is substantially identical to a portion of a sequence 

30 selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The primers may optionally be labeled to facilitate their detection. 
Methods and compositions for use in labeling primers are discussed further in the application in 
Section m. 

35 Other preferred kits comprise (i) a first primer comprising a sequence which is 

substantially identical to a portion of the complement of a portion of a sequence selected from 
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the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 
7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
NO: 7300 and SEQ ID NO: 7301 and (ii) a second primer comprising a sequence which is 
substantially complementary to a portion of the complement of a sequence selected from the 
5 group consisting of SEQ ID NO: 1 and SEQ ID NO: 2, such that the primer pair defines a 

template sequence within a sequence selected from the group consisting of SEQ ID NO: 1 and 
SEQ ID NO: 2. 

The invention further includes an attenuated S ARS virus for use as a vaccine wherein the 
intergenic region has been mutated to reduce expression of the viral structural or nonstructural 

10 proteins. The attenuated SARS virus may comprises one or more additions, deletions or 

insertion in one or more of the intergenic regions of the viral genome. Preferably, the attenuated 
SARS virus comprises an addition, deletion or insertion in one or more occurrences of the 
sequence selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. 
Preferably, the addition, deletion or insertion occurs in one or more occurrences of SEQ ID NO: 

15 7293. 

The invention further comprises a small molecule which inhibits binding or association of 
the SARS viral replication machinery, such as a ribonucleoprotein, with the intergenic region of 
the viral genome. Preferably, the small molecule inhibits binding or association of the SARS 
viral machinery with a sequence selected from the group consisting of SEQ ID NO: 7292 and 

20 SEQ ID NO: 7293. Preferably, the small molecule inhbiits binding or association of the SARS 
viral machinery with SEQ ID NO: 7293. The invention further includes a method of screening 
for a small molecule for treatment of SARS viral infection comprising using an assay to identify 
a small molecule which interferes with the association of the SARS viral rephcation machinery 
with the intergenic region of the SARS viral genome. 

25 The invention further provides a novel SARS polynucleotide sequence SEQ ID NO: 9968. 

All six reading frames of this 690mer sequence are shown in Figure 113. The constituent amino 
acid sequences from Figure 113, having at least 4 amino acids, are listed as SEQ ID NOS: 9969 
to 10032. 

Accordingly the invention includes a polynucleotide sequence comprising SEQ ID NO: 
30 9968. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 9968. 
The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 80%, 85%, 
88%, 90%, 92%, 95%, 99% or more). 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 9968, including the amino acid sequences selected from the group consisting of 
35 SEQ ID NO^: 9969 to 10032. Preferably, the amino acid sequence comprises SEQ ID NO: 9997 
or comprises SEQ ID NO: 9998. 
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The invention also pro>ddes amino acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ BO NO: 9968. The invention provides amino acids having 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO^: 
9969 to 10032. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 
80%, 85%, 88%, 90%, 92%, 95%, 99% or more). 

A portion of SEQ ID NO: 9968 matches with approximately 98% identity to a previously 
published SARS polynucleotide sequence, commonly referred to as 'TBNI-l" (SEQ ID NO: 
10033). BNI-1 was sequenced at Bemhard Nocht Listimte for Tropical Medicine, National 
Reference Center for Tropical Infectious Diseases in Hamburg, Geraiany. The BNI-1 sequence 
was pubUshed on the WHO website on April 4, 2003 at http://www.who.mt/csr/sars/primers/en and in 
Dorsten et al, "Identification of a Novel Coronaviras in Patients with Severe Acute Respiratory 
Syndrome". New England Journal of Medicine, published onUne at http://www.nejm.org on April 
10, 2003. Both references are incorporated herein by reference in their entirety. The six reading 
frames of this 302mer sequence are shown in Figure 114 (see also Figure 129). The constituent 
amino acid sequences from Figure 114, having at least 4 amino acids, are Hsted as SEQ ID NO^: 
10034 to 10065. An aHgnment of SEQ ID NO: 10034 with SEQ ID NO: 9997 is shown in Figure 

130: 

The invention provides for polynucleotide sequences comprising fragments of SEQ ID 
NO: 9968. In one embodiment, the fragment does not consist entirely of SEQ ID NO: 10033 or 
of a known coronavirus. 

;The invention provides for amino acid sequences comprising fragments of an amino acid 
sequence encoded by SEQ ID NO: 9968. In one embodiment, the fragment does not consist 
entirely of an amino acid sequence encoded by SEQ ID NO: 10033 or a known coronavirus. 

The invention provides for amino acids comprising fragments of an amino acid sequence 
selected from the group consisting of SEQ ID NO^: 9969 to 10032. In one embodiment, the 
fragment does not consist entirely of an amino acid sequence encoded by die polynucleotide 
sequence of SEQ ID NO: 10033 or a known coronavirus. 

Approximately 100 nucleotides at the 5' end of SEQ ID NO: 9968 do not match any 
portion of die BNI-1 polynucleotide sequence (SEQ ID NO: 10033). This unmatched portion is 
set forth as SEQ ID NO: 10066. The invention tiius further provides a polynucleotide comprising 
the sequence comprising SEQ ID NO: 10066. polynucleotide sequences having sequence 
identity to SEQ ID NO: 10066, or polynucleotide sequences comprising fragments of SEQ ID 
NO: 10066. 

The invention further comprises an amino acid sequence encoded by SEQ ID NO: 10066, 
an amino acid sequence having sequence identity to an amino acid sequence encoded by SEQ ID 
NO: 10066, or an amino acid sequence comprising fragments of an amino acid sequence 
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encoded by SEQ ID NO: 10066. Preferably, the amino acid sequence comprises SEQ ID NO: 
10067. 

SEQ ID NO: 9997/9998 demonstrates homology with the a region of pollab of several 
coronaviruses. FIGURE 115 shows an alignment of SEQ ID NO^: 9997/9998 to amino acid 
5 sequences for pollab of bovine coronavims (SEQ ID NO: 10068), avian infectious bronchitis 
' virus (SEQ ID NO: 10069) and murine hepatitis virus (SEQ ID NO: 10070). A consensus amino 
acid sequence of SEQ ID NO^: 9997/9998, SEQ ID NO: 10068, SEQ ID NO: 10069, and SEQ 
ID NO: 10070 is shown in the bottom row of the alignment in Figure 115 {e.g. SEQ ID NO: 
10071). 

10 As shown in FIGURE 113, the polynucleotide sequence encoding SEQ ID NO: 9997 has a 

stop codon after codon 205, between SEQ ID NO^: 9997 and 9998. Optionally, the stop codon 
can be removed and the amino acid sequence continued (SEQ ID NO: 10072). Accordingly, the 
invention provides for an amino acid sequence comprising SEQ ID NO: 9997 and/or SEQ ID 
NO: 9998, or SEQ ID NO: 10072, and further comprising an amino acid sequence encoding for 

15 the C-terminus of a coronavirus pol lab gene or a fragment thereof. 

As shown in FIGURE 115. SEQ ID NO^: 10068, 10069, 10070 and 10071 contain amino 
acids prior to the N-tenninus of SEQ ID NO: 9997. The invention also provides for an amino 
acid sequence comprising SEQ ID NO: 9997 and further comprising an amino acid sequence 
encoding for the N-terminus of a coronavirus pol lab protein or a fragment thereof. 

20 The pollab sequences on FIGURE 115 contain a coding region indicated on the 

schematic of FIGURE 1 17 by a In FIGURE 115, the beginning of this genomic region is 
designated by the arrow crossing in front of amino acid 6080 of the consensus sequence SEQ ID 
NO: 10071. The end of this genomic region is designated by the arrow crossing in front of 
amino acid 6604 of the consensus sequence. The invention provides for an amino acid sequence 

25 comprising SEQ ID NO: 9997 and/or SEQ ID NO: 9998, or SEQ ID NO: 10072, and further 

comprising a first amino sequence prior to the N-terminus of said SEQ ID NO: 9997 and/or SEQ^ 
ID NO: 9998, or SEQ ID NO: 10072, wherein said first amino acid sequence has homology to an 
N-terminus sequence of a known coronavirus pollab protein or a fragment thereof. 

The invention further provides for an amino acid sequence comprising SEQ ID NO: 9997 

30 and SEQ ID NO: 9998, wherein the stop codon after SEQ ID NO: 9971 is removed {i.e. SEQ ID 
NO: 10072), and further comprising a second amino acid sequence following the C terminus of 
SEQ ID NO: 9998, wherein said second amino acid sequence is homologous with a C terminus 
of a known coronavirus pollab protein or a fragment thereof. 

Examples of such proteins are shown aligned in FIGURE 118, and are SEQ ID NO^: 

35 10073 to 10077. SEQ ID NO: 10073 comprises SEQ ID NO: 9997 and further comprises amino 
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acids prior to the N-teiminus and subsequent to the C-terminus from the pollab "*" protein of 
avian infectious bronchitis virus. SEQ ID NO: 10074 comprises SEQ ID NO: 9997 and further 
comprises amino acids prior to the N-terminus and subsequent to the C-terminus from the 
pollab protein of bovine coronavirus. SEQ ID NO: 10075 comprises SEQ ID NO: 9997 
5 and further comprises amino acids prior to the N-terminus and subsequent to the C-tenninus 
from the pollab "*" protein of murine hepatitis virus. SEQ ID NO: 10076 comprises SEQ ID 
NO: 9997 and further comprises amino acids prior to the N-terminus and subsequent to the 
C-terminus from the consensus of the pollab "*" protein of avian infectious bronchitis virus, 
bovine coronavirus, and murine hepatitis virus (FIGURE 115). SEQ ID NO: 10077 comprises 
10 the consensus sequence of SEQ ID NOS: 10073 to 10076. 

The invention comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO^: 10073, 10074, 10075, 10076 and 10077. The invention further includes an amino 
acid sequence comprising fragments of an amino acid sequence selected from the group 
consisting of SEQ ID NO^: 10073, -10074, 10075. 10076 and 10077. The invention further 
15 comprises an amino acid sequence with sequence identity to a sequence selected from the group 
consisting of SEQ ID NO^: 10073, 10074, 10075, 10076 and. 10077. 

The invention comprises polynucleotides encoding for the amino acid sequences selected 
from the group consisting of SEQ ID NO^: 10073, 10074, 10075, 10076 and 10077. The 
invention comprises polynucleotides having sequence identity to polynucleotides encoding for 
20 the amino acid sequences selected from the group consisting of SEQ ID NO^: 10073, 10074, 

10075, 10076 and 10077. The invention comprises fragments of polynucleotides encoding SEQ 
ID NO^: 10073, 10074, 10075, 10076 and 10077. 

As shown in Figure 113, SEQ ID NO: 9968 includes a sequence that encodes SEQ ID NO: 
10020 followed by a stop codon. giving a C-teiminus threonine (Thr) residue. The corresponding 
25 sequence from an amino acid sequence encoded by BNI-1 is SEQ ID NO: 10078, which 

continues past the C-terminus of SEQ ID NO: 10020. Accordingly, the invention includes a 
protein comprising amino acid sequence SEQ ID NO: 10020 or an amino acid sequence having 
sequence identity to SEQ ID NO: 10020 or an amino acid sequence comprising a fragment of 
SEQ ID NO: 10020, wherein the C-terminus residue of said protein is a threonine. Preferably, 
30 the C-terminus of said protein is -ST. Still more preferably, the C-terminus of said protein is - 
EST. The invention also includes a protein comprising amino acid sequence SEQ ID NO: 10078 
or an amino acid sequence having sequence identity to SEQ ID NO: 10078 or an amino acid 
sequence comprising a fragment of SEQ ID NO: 10078, wherein the C-terminus residue of said 
protein is Thr. Preferably, tfie C-terminus of said protein is -ST. Still more preferably, the C- 
35 terminus of said protein is -EST. 
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SEQ ID NO: 9968 also encodes a 54iner amino acid sequence SEQ ID NO: 10015. The 
polynucleotide encoding SEQ ID NO: 10015 encodes two stop codons at its C-tenninus (Rgure 
113). The corresponding region from the BNI-1 sequence does not contain this 54mer. 
Accordingly, the invention includes a protein comprising amino acid sequence SEQ ID NO: 
10015, or an amino acid sequence having sequence identity to SEQ ID NO: 10015 or an amino 
acid sequence comprising a fragment of SEQ ID NO: 10015. The invention further includes a 
polypeptide comprising SEQ ID NO: 10015 and fiirther comprising a first amino acid sequence 
prior to the N-tenninus of SEQ ID NO: 10015. 

SEQ ID NO: 9968 encodes the amino acid sequence SEQ ID NO: 9969. The 
polynucleotide sequence contains a stop codon at the C-terminus of SEQ ID NO: 9969. 
Accordingly, the invention includes a protein comprising amino acid sequence SEQ ID NO: 
9969, or an amino acid sequence having sequence identity to SEQ ID NO: 9969. The invention 
further includes a polypeptide comprising SEQ ID NO: 9969 and further comprising a first 
amino acid sequence prior to the N-terminus of SEQ ID NO: 9969. The invention further 
includes a pplypeptide comprising the sequence SEQ ID NO: 10079. 

SEQ ID NO: 9968 encodes amino acid sequence QRT (Figure 113), followed by a stop 
codpn. Accordingly, the invention includes a protein comprising amino acid sequence QRT. The 
invention further includes a polypeptide comprising amino acid sequence QRT and further 
comprising a first amino acid sequence prior to the N-terminus of the sequence QRT. 

, SEQ ID NO: 9968 encodes amino acid sequence SEQ ID NO: 10022, followed by a stop 
codon at its C-terminus. Accordingly, the invention includes a protein comprising amino acid 
sequence SEQ ED NO: 10022, or an amino acid sequence having sequence identity to SEQ ID 
NO: 10022. The invention further includes a polypeptide comprising SEQ ID NO: 10022 and 
farther comprising a first amino acid sequence prior to the N-terminus of SEQ ID NO: 10022. 

SEQ ID NO: 9968 encodes amino acid sequence SEQ ID NO: 10027. Within the SEQ ID 
NO: 10027 coding sequence there are at least three start codons, identified with underlining in 
Figure 119. The open reading firame indicated by the first start codon is SEQ ID NO: 10081. The 
open reading frame indicated by the second start codon is SEQ ID NO: 10082. The open reading 
frame indicated by the third start codon is SEQ ID NO: 10083. 

The invention provides a novel SARS polynucleotide sequence SEQ ID NO: 10084. All 
six reading frames of this 1463mer sequence are shown in Figure 120 (see also Figure 122). The 
constituent amino acid sequences from Figure 120, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10085 to 10209 (see Figures 120A to 120F). 

The invention includes a polynucleotide sequence comprising SEQ ID NO: 10084. The 
invention also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10084. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
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ID NO: 10084. In one embodiment, the polynucleotide fragment does not consist entirely of 

SEQ ID NO: 10033 or a known coronaviras polynucleotide sequence or a known SARS 

polynucleotide sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
5 SEQ ID NO: 10084, including the amino acid sequences of Figures 120A to 120F e.g. selected 

from the group consisting of SEQ ID NO^: 10085 to 10209. Preferably, the amino acid sequence 

comprises SEQ ID NO: 10149. 

The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 10084. The invention provides amino acids having 
10 sequence identity to an amino acid sequence from Figures 120A to 120F e.g. selected from the 

group consisting of SEQ ID NO^: 10085 to 10209.' 

The invention also provides fragments of amino acid sequences encoded by SEQ ID NO: 

10084. The invention also provides fragments of amino acid sequences selected from the group 

consisting of SEQ ID NO^: 10085 to 10209. Li one embodiment, the fragment does not consist 
15 entirely of an amino acid sequence encoded by SEQ ID NO: 10033 or an amino acid sequence of 

a known coronavirus or an amino acid sequence of a known SARS virus. An alignment of the 

matching portion of SEQ ID NO: 10033 and SEQ ID NO: 10084 is included in FIGURE 121. 
, In one embodiment, the invention comprises an amino acid sequence comprising SEQ ID 

NO: 10149. Ah alignment of the polynucleotide sequence SEQ ID NO: 10084 to the encoded 
20 SEQ ID NO: 10149 is shown in FIGURE 122 (5*3' Frame 3). Analysis of the 5'3' Frame 3 

translation by a computer program to predict start codon methionines OSIetStart 1 .0) (FIGURE 

123) reveals SEQ ID NO^: 10210 to 10215. 

The invention includes a protein comprising an amino acid sequence selected from the 

group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ ID NO: . 
25 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. The invention includes a protein having 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 

10210, SEQ ID NO: 10211, SEQ ID NO: 10212. SEQ ID NO: 10213, SEQ ID NO: 10214 and 

SEQ ID NO: 10215. In one embodiment, the protein does not consist entirely of an amino acid 

sequence of a known SARS virus or of a known coronavirus. 
30 The invention includes a fragment of a protein comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ 

ID NO: 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. In one embodiment, the fragment 

does not consist entirely of an amino acid sequence of a known SARS virus or of a known 

coronaviras. 

35 In one embodiment, the invention includes a polypeptide comprising the amino acid 

sequence selected from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211 and 
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SEQ ID NO: 10212. Partial results of a BLAST of SEQ ID NO: 10210 against GenBank is 
included in FIGURE 124. These results indicate that SEQ ID NOS: 10210, 10211 and 10212 
have functional similarities to a Coronavirus RNA polymerase, particularly the RNA polymerase 
of murine hepatitis virus, bovine coronavirus, and avian infectious bronchitis. 

5 In one embodiment, the invention is directed to a polypeptide comprising a first amino acid 

sequence selected from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 1021 1 and 
SEQ ID NO: 10212 and a second amino acid sequence from tiie C-terminus of a coronavirus 
ORFlab sequence. Preferably, the second amino acid sequence is from a bovine coronavirus. 
One example of this embodiment is shown below as SEQ ID NO: 10216. Amino acids 1-481 of 

10 SEQ ID NO: 10216 are the first amino acid sequence of SEQ ID NO: 10210, and amino acids 
482-1152 are the second amino acid sequence of the C-terminus of a bovine coronavirus 
orf lab polyprotein (Gi 26008080) (NP_150073.2) (SEQ ID NO: 10217). 

• Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 10216. The 
invention further includes a polypeptide comprising a first amino acid sequence of SEQ ID NO: 

15 10210 and a second amino acid sequence of SEQ ID NO: 10217. The invention further includes- 
a polypeptide comprising a first amino acid sequence having greater than x% identity to SEQ ID 
NO: 10210 and a second amino acid sequence having greater than y% identity to SEQ ID NO: 
10217, wherein x is greater than or equal to 85% (e.g., 86%, 87%, 88%, 89%, 90%, 91%, 92%, ' 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more) and wherein y is greater than or equal to 60% 

20 (e.g., 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more). 

The invention also includes a polypeptide comprising a fragment of SEQ ID NO: 10210, 
wherein said fragment includes an epitope. Computer-predicted epitopes of SEQ ID NO: 10210, 
using a 17mer window, are included in FIGURE 125A (Hopp & Woods) and FIGURE 125B 
(Kyte «fe Doolittle). 

25 The amino acid sequence of SEQ ID NO: 10210 also contains two predicted glycosylation 

sites at amino acids 81-84 (NNTE; SEQ ID NO: 10218) and at 180-183 (NHSV; SEQ ID NO: 
10219). Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 10210, wherein said fragment includes a glycosylation site. The invention further includes 
a polypeptide comprising a fragment of SEQ ID NO: 10210, wherein said fragment includes the 

30 Asn at position 81. Preferably, said Asn is glycosylated. The invention further includes a 

polypeptide comprising a fragment of SEQ ID NO: 10210, wherein said fragment includes the 
Asn at position 180. Preferably, said Asn is glycosylated. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from within Figure 120D and/or SEQ ID NO^: 10150 to 10160 e.g. firom SEQ ID NO^: 

35 10154, 10155, 10158 and 10160. Within SEQ ID NO: 10154 the following amino acid sequences 
starting with a Met and ending at a stop codon can be identified: SEQ ID NO®: 10220 to 10227. 
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Accordingly, the invention includes a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10220, SEQ ID NO: 10221, SEQ ID NO: 
10222, SEQ ID NO: 10223, SEQ ID NO: 10224, SEQ ID NO: 10225, SEQ JD NO: 10226 and 
SEQ ID NO: 10227, or a fragment thereof or an amino acid sequence having sequence identity 
5 thereto. 

In one embodiment, the invention includes a polypeptide comprising the amino acid 
sequence within Figure 120E e.g. from SEQ ID NO^: 10161 to 10182, and in particular SEQ ID 
NOS: 10171 and 10176. Within SEQ ID NO^: 10171 and 10176 the foUowing amino acid 
sequences starting with a Met and ending at a stop codon can be identified: SEQ ID NO: 10228 
10 and SEQ ID NO: 10229. 

Accordingly, the invention includes a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10228 and SEQ ID NO: 10229, or a fragment 
thereof or an amino acid sequence having sequence identity thereto. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
15 sequence from Figure 120F e.g. SEQ ID NO^: 10183 to 10209. Within Figure 120F the 

foUowing amino acid sequence starting with a Met and ending at a stop codon can be identified- 
SEQ ID NO: 10187. Accordingly, the invention includes a polypeptide comprising an amino 
acid sequence of SEQ ID NO: 10187, or a fragment thereof or an amino acid sequence having 
sequence identity thereto. 
20 In one embodiment, the polynucleotides of the invention do not include one of the 

following primers, disclosed at http://coment.nejm.org/cgi/reprint/NEJMoa030781v2:pdf: 

5 • GGGTTGGGACTATCCTAAGTGTGA3 ' (SEQ ID NO: 10230) 

5 ' TAACACACAACICCATCATCA3 ■ (SEQ ID NO: 10231) 

5 • CTAACATGCTTAGGATAATGG3 ' ( SEQ ID NO : 1 0 2 3 2 ) 

25 5 ' GCCTCTCTTGTTCTTGCTCGC3 • (SEQ ID NO: 10233) 

5 • CAGGTAAGCGTAAAACTCATC3 • (SEQ ID NO: 10234) 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 

30 or identify the presence or absence of a SARS virus in a biological sample. The invention 

includes the polynucleotide primers identified in Table 31 (SEQ ID NO^: 10235 to 10258), the 
forward primers SEQ ID NO^: 10259 to 10281 and the reverse primers SEQ ID NO^: 10282 to 
10298. The invention further includes polynucleotide sequences which are complementary to 
any one of these primer sequences disclosed herein. 

35 The invention provides a SARS polynucleotide sequence SEQ ID NO: 10299. AU six 

reading frames of this sequence are included in FIGURE 126 (See also Figure 131). The 
constituent amino acid sequences from Figure 126, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10300 to 10337. 
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Accordingly, the invention includes a polynucleotide sequence comprising SEQ ID NO: 
10299. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10299. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10299. La one embodiment, the polynucleotide fi-agment does not consist entirely of a 
5 known polynucleotide sequence of a" S ARS virus or a known polynucleotide sequence of a 
coronavirus. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10299, including the amino acid sequences shown in Figure 126, and the amino 
acid sequences selected from the group consisting of SEQ ID NO^: 10300 to 10337. Preferably, 
10 the amino acid sequence comprises SEQ ID NO: 10316. 

The invention also provides amino acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ ID NO: 10299. The invention provides amino acid sequences 
having identity to an amino acid sequence selected from the group consisting of SEQ ID NO^: 
10300 to 10337. 

15 ' The invention also provides fragments of amino acid sequences encoded by SEQ ID NO: 
10299. The invention also provides fragments of amino acid sequences selected froni the group 
consisting of SEQ ID NO^: 10300 to 10337. In one embodiment, the fragment does not consist 
entirely of a known amino acid sequence of a S ARS virus or a known amino acid sequence of a 
coronavirus. , 
20 In one embodiment, the invention comprises an amino acid sequence comprising SEQ ID ' 

NO: 10316. Encoded open reading frames within SEQ ID NO: 10316 include SEQ ID NO: 
10338 and SEQ ID NO: 10339. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
sequence from within the 5'3' Frame 1 translation of SEQ ID NO: 10299. The following encoded 
25 open reading freime is found within this translation: SEQ ID NO: 10340. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
sequence from within the 3'5' Frame 1 translation of SEQ ID NO: 10299. An encoded open 
reading frame within this translation is SEQ ID NO: 10341. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
30 sequence from within the 3'5' Frame 2 translation of SEQ ID NO: 10299. An encoded open 
reading frame within this translation is SEQ ID NO: 10342. 

The invention includes a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 
10341 and SEQ ID NO: 10342. The invention includes a polypeptide having sequence identity 
35 to an amino acid sequence selected from the group consisting of SEQ ID NO: 10338, SEQ ID 
NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 10341 and SEQ ID NO: 10342. The invention 
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includes a fragment of a polypeptide comprising an amino acid sequence elected from the group 
consisting of SBQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO- 10341 
»d SEQn>NO: 10342. m one embodiment the fragment does not consist entirely of a known 
SARS vmis amino add sequence or of a known coronavirus amino acid sequence 

5 I»<'»^»°"'odime„t.SEQlDNOS:10338-10342areusedinfi.sionpro,eins. Accordingly 

thestartcodonmefluoninesmayberemoved. The invention comprises a amino acid sequence ' 
selected from the group consisUng of SEQ ID NO: 10343, SEQ ID NO: 10344 SEOIDNO- 
10345, SEQ ID NO: 10346 and SEQ ID NO: 10347. ' 

In one embodiment, the invention comprises an amino acid sequence selected from the 

r3,Tr,?'°"^*^'°'''°^ 10338andSEQIDNO: 10339. Partial BLAST results of SEQ 
ID NO: 10338 against GenBank are given below • 

<^--^l^-s±bX. .Utro^t.ritir^!^:\^^^'''^ 2.7.7.48) - porcine 

Length = 533 
Score = .131 bits (329), Expect = 3e-30 

Identxties = 55/89 (61%), Positives = 69/89 (77%), Gaps = 1/89 (1%) 
Quer.: 1 . SJf--™ 

Sb,ct: 217 -WCE.SHlK™Ui^^^^^^^^ . 

Query: 61 VAKYTQLCQYLNTLTLAVPSNMRVIHFGA 89 
CK- . o^. ^ KYTQLCQYLNT TL VP MRV+H GA 
SbDct: 276 WKYTQLCQYLNTTTLCVPHKMRVLHLGA 304 

These results indicate that SEQ ID NO: 10338 has functional sinularities to an RNA- 
directed RNA polymerase of porcine transmissible gastroenteritis virus 

Partial BLAST results of SEQ ID NO: 10339 against GenBank are given below 

>gbjAAI,57305.l| replicase [bovine coronavirus] 
Length = 7094 

•Score = 139 bits (351) . Expect = 7e-33 

Identities = 64/108 (59%), Positives = 78/108 (72%) 

' so 

S..ct: 6760 -CVSKV_.kOPqT^,,^^^ 
Query: 61 ^^^Q^GENAVIPKGim^AKYTQLCQYLI^^^^ 

Sb,ct: 6820 VS^^^KPV^ip^S™^^^ ,3,, 

TheseresultsindicatethatSEQIDNO:10339hasfunctionalsimiiaritiestoareplicaseof 
bovme coronavirus. ^ 
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The SARS virus may contain polymorphism at the Glu-20 residue of SEQ ID NO: 10338. 
The invention includes a polypeptide comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino acid sequence 
selected from the group consisting of ASQAW (SEQ ID NO: 10348) and ASRAW (SEQ ID NO: 
5 10349). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 10338, 
wherein said fragment includes an amino acid sequence selected from the group consisting of 
SEQ ID NO: 10348 and SEQ ID NO: 10349. 

The SARS virus may contain polymorphism at the Ser-80 residue of SEQ ID NO: 10338. 
below. The invention includes a polypeptide comprising an amino acid sequence having 

10 sequence identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino acid 

sequence selected from the group consisting of VPSNM (SEQ ID NO: 10350) and VPTNM (SEQ 
ID NO: 10351). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
10338, wherein said fragment includes an amino acid sequence selected from the group 
consisting of SEQ ID NO: 10350 and SEQ ID NO: 10351. 

15 The invention also includes polynucleotide sequences which can be used as probes for 

diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
Table 32. The invention further includes polynucleotide sequence comprising the complement of 

20 one or more of the primer sequences identified in Table 32. 

The invention provides a SARS polynucleotide sequence SEQ ID NO: 10505. All six 
reading frames of this sequence are shown in Figure 127 (see also Figure 132). The constituent 
amino acid sequences from Figure 127, having at least 4 amino acids, are listed as SEQ ID NOS: .. 
10506 to 10570. 

25 The invention includes a polynucleotide sequence comprising SEQ ID NO: 10505. The 

invention also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10505. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10505. In one embodiment, the polynucleotide fragment does not consist entirely of a 
known SARS virus polynucleotide sequence or of a known coronavirus polynucleotide 

30 sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10505, including the amino acid sequences shown in Figure 127, and particularly 
those selected from tiie group consisting of SEQ ID NO^: 10506 to 10570. Preferably, the amino 
acid sequence comprises SEQ ID NO: 10532 and/or SEQ ED NO: 10533. 
35 The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 10505. The invention provides amino acid sequences 
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As used herein, the term "target nucleic acid region" or "target nucleic acid" denotes a 
nucleic acid molecule with a "target sequence" to be amplified. The target nucleic acid may be 
either smgle-stranded or double-stranded and may include other sequences besides the target 
sequence, which may not be amplified. The term "target sequence" refers to the particular 
nucleotide sequence of the target nucleic acid which is to be amplified. The target sequence may 
mclude a probe-hybridizing region contained within the target molecule with which a probe will 
form a stable hybrid under desired conditions. The "target sequence" may also include the 
complexing sequences to which the oligonucleotide primers complex and be extended using tiie 
target sequence as a template. Where the target nucleic acid is originally single-stranded the 
term "target sequence" also refers to the sequence complementary to the "target sequence" as 
presentinthetargetnucleicacid. If the "target nucleic acid" is originally double-stranded the 
term "target sequence" refers to both the plus (+) and minus (-) strands. 

The term "primer" or "oligonucleotide primer" as used herein, refers to an oUgonucleotide 
which acts to initiate synthesis of a complementary DNA strand when placed under conditions in • 
which synthesis of a primer extension product is induced i.e. in the presence of nucleotides and a • 
polymenzation-inducing agent such as a DNA or RNA polymerase and at suitable temperature • : 

pH, metal concentration, and salt concentration. The primer is preferably single-stranded for ' 
maximum efficiency in amplification, but may alternatively be double-stranded. If double- 
stranded, the primer is fir.t treated to separate its strands before being used to prepare extension 
products. This denaturation step is typically effected by heat, but may alternatively be carried 
outusingalkali,followedbyneutralization. Thus, a "primer" is complementary to a template • 
and complexes by hydrogen bonding or hybridization with the template to give a primer/tempiate 
complex for initiation of synthesis by a polymerase, which is extended by the addition of 
covalently bonded bases linked at its 3' end complementary to the template in the process of 
DNA synthesis. 

As used herein, the teim "probe" or "oUgonucleotide probe" refers to a structure compdsed 
of a polynucleotide, as defined above, that contains a nucleic acid sequence complementary to a 
nuclerc acid sequence present in the target nucleic acid analyte. The polynucleotide regions of 
probes may be composed of DNA. and/or RNA. and/or synthetic nucleotide analogs. When an 
"ohgonucleoUde probe" is to be used in a 5' nuclease assay, such as the TaqManTM technique the 
probe wm contain at least one ftoorescer and at least one quencher which is digested by the 5' 
endonuclease acUvity of a polymerase used in fl,e reaction in order to detect any ampUfied target 
oligonucleotide sequences. L. this context, the oligonucleotide probe will have a sufficient 
number of phosphodiester linkages adjacent to its 5' end so that the 5' to 3' nuclease activity 
employed can efficiently degrade the bound probe to separate the fluorescers and quenchers 
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When an oligonucleotide probe is used in the TMA technique, it vdll be suitably labeled, as 
described below. 

It will be appreciated that the hybridizing sequences need not have perfect 
complementarity to provide stable hybrids. In many situations, stable hybrids will form where 
5 fewer than about 10% of the bases are mismatches, ignoring loops of four or more nucleotides. 
Accordingly, as used herein the term "complementary" refers to an oligonucleotide that forms a 
stable duplex with its "complement" under assay conditions, generally where there is about 90% 
or greater homology. 

The terms "hybridize" and "hybridization" refer to the formation of complexes between 

10 nucleotide sequences which are sufficiently complementary to form complexes via Watson-Crick 
base pairing. Where a primer "hybridizes" with target (template), such complexes (or hybrids) 
are sufficiently stable to serve the priming function required by e.g. the DNA polymerase to 
initiate DNA synthesis. 

Stringent hybridization conditions will typically include salt concentrations of less than 

15 about 1 M, more usually less than about 500 mM and preferably less than about 200 mM. 
Hybridization temperatures can be as low as 5*'C, but are typically greater than 22**C^ more 
typically greater than about SO^C, and preferably in excess of about 3T*C. Longer fragments 
may require higher hybridization temperatures for specific hybridization. Other factors may 
affect the stringency of hybridization, including base composition and length of the 

20 complementary strands, presence of organic solvents and extent of base imsmatching, and the 
combination of parameters used is more important than the absolute measure of any one alone. 
Other hybridization conditions which may be controlled include buffer type and concentration, 
solution pH, presence and concentration of blocking reagents to decrease background binding 
such as repeat sequences or blocking protein solutions, detergent type(s) and concentrations, 

15 molecules such as polymers which increase the relative concentration of the polynucleotides, 
metal ion(s) and their concentration(s), chelator(s) and their concentrations, and other conditions 
known in the art. Less stringent, and/or more physiological, hybridization conditions are used 
where a labeled polynucleotide amplification product cycles on and off a isubstrate linked to a 
complementary probe polynucleotide during a real-time assay which is monitored during PGR 

JO amplification such as a molecular beacon assay. Such less stringent hybridization conditions can 
also comprise solution conditions effective for other aspects of the method, for example reverse 
transcription or PGR. 

As used herein, a "biological sample" refers to a sample of tissue, cells or fluid isolated 
firom a subject, that conmionly includes antibodies produced by the subject. Typical samples 
15 include but are not limited to, blood, plasma, serum, fecal matter, urine, bone marrow, bile, 
spinal fluid, lymph fluid, samples of the skin, secretions of the skin, respiratory, intestinal, and 
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genitourinary tracts, tears. saUva, sputum, mucous, milk, blood cells, organs, tissues, biopsies 
(e.g. lung, liver, kidney) and also samples of in vitro ceU culture constituents including but not 
limited to conditioned media resulting from the growth of cells and tissues in culture medium 
e.g. recombinant cells, and cell components. Other samples that may be used for diagnosis 
include stool samples and nasopharyngeal aspirates. 

The term "antibody" encompasses polyclonal and monoclonal antibody preparations, as 
well as preparations including hybrid antibodies, altered antibodies, chimeric antibodies arid, 
humanized antibodies, as well as: hybrid (chimeric) antibody molecules (see, for example 
Winter et al (1991) Nature 349:293-299; and US Patent 4.816.567); F(ab')2 and F(ab) 
fragments; Fv molecules (noncovalent heterodimers, see, for example. Inbar et al (1972) Proc 
Natl Acad Sci USA 69:2659-2662; and Ehrlich et al (1980) Biocltem 19:4091-4096); 
single-chain Fv molecules (sFv) {see, e.g., Huston et al (1988) Proc Natl Acad Sci USA 
85:5879-5883); oligobodies; dimeric and trimeric antibody fragment constructs; minibodies (see 
e.8. . Pack et al (1992) Biochem 31:1579-1584;X:umber et al (1992) J Immunology 
149B:120-126); humanized antibody molecules (see, e.g, Riechmann et al (1988) Nature 
332:323-327; Verhoeyan et al (1988) Science 239:1534-1536; and UK Patent Publication No 
GB 2.276,169, published 21 September 1994); and, any functional fragments obtained from such 
molecules, wherein such fragments retain specific-binding properties of the parent antibody 
molecule. 

As used herein, the term "monoclonal antibody" refers to an antibody composition having 
a homogeneous antibody population. The term is not limited regarding the species or source of • = 
the antibody, nor is it intended to be limited by the manner in which it is made. The term 
encompasses whole immunoglobulins. 

Methods of making polyclonal and monoclonal antibodies are known in the art Polyclonal 
antibodies are generated by immunizing a suitable animal, such as a mouse, rat. rabbit, sheep or • 
goat, with an antigen of interest. In order to enhance immunogenicity, the antigen can be Unked 
to a earner prior to immunization. Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers. Upid aggregates (such as oil droplets or 
liposomes), and inactive virus particles. Such carriers are well known to those of ordinary skill in 
the art. Furthermore, the antigen may be conjugated to a bacterial toxoid, such as toxoid from 
diphtheria, tetanus, cholera, etc., in order to enhance the immunogenicity thereof. 

Rabbits, sheep and goats are preferred for the preparation of polyclonal sera when large 
volumes of sera are desired. These animals are good design choices also because of the 
availability of labeled anti-rabbit, anti-sheep and anti-goat antibodies. Immunization is generally 
performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as 
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Freund's complete adjuvant ("FCA"), and injecting the mixture or emulsion parenterally 
(generaUy subcutancously or intramuscularly). The animal is generally boosted 2-6 weeks later 
with one or more injections of the antigen in saline, preferably using Freund's incomplete 
adjuvant ("FIA"). Antibodies may also be generated by in vitro inununization, using methods 
known in the art. Polyclonal antisera is tiien obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495-497, or a modification thereof, as described above. 
Nucleic acid detection methods 

There are many well known methods of ampUfying targeted sequences, such as the 
polymerase chain reaction (PGR), reverse transcription PGR (RT-PGR), the ligase chain reaction 
(LCR), the strand displacement amplification (St) A), and the nucleic acid sequence-based 
amplification (NASBA), transcription-mediated amplification (TMA) to name a few. These 
methods are described generally in the following references: (PGR) US Patents 4,683,195, 
4,683,202. and 4,800.159; (RT-PCR) US patent 5,310,652, 5,322,770; (LGR) EP Application 
No., 320,308 pubUshed Jun. 14. 1989; (SDA) US Pat. Nos. 5,270,184, and 5,455,166 and 
"Empirical Aspects of Strand Displacement Amplification" by G. T. Walker in PCR Methods 
and Applications, 3(l):l-6 (1993), Cold Spring Harbor Laboratory Press; (TMA) US Patent No. 
5,399,491, and (NASBA) "Nucleic Acid Sequence-Based Amplification (NASBAtm)" by L. 
Malek et al, Ch. 36 in Methods in Molecular Biology, Vol. 28: Protocols for Nucleic Acid 
Analysis by Nonradioactive Probes, 1994 Ed. P. G. Isaac, Humana Press, Inc., Totowa, N.J. PGR 
methods may include variations that permit quantitation of the target sequence, for example, by 
real time PGR analysis (e.g., as described in US patents 5,210,015, 5,487,972, 5,994,056, 
6,171,785 inter alia). (Each of the above references are hereby incorporated by reference). 

One embodiment of the method of the invention for detecting the presence of S ARS virus 
in a sample comprises providing a sample suspected of containing a S ARS virus nucleic acid 
target, ampUfying a template sequence contained within said SARS virus nucleic acid target by 
any known technique of nucleic acid ampUfication, including any of those mentioned herein, 
using the oUgonucleotide primers described herein, particularly those primers comprising the. kits 
described herein, and detecting the amplified template sequence, wherein the presence of tiie 
ampUfied template sequence indicates the presence of SARS virus in said sample. 

Amplification techniques generally involve the use of two primers. Where a target 
sequence is single-sti-anded, the techniques generally involve a preliminary step in which a 
complementary sti-and is made in order to give a double-stiranded target. The two primers 
hybridize to different stirands of the double-stranded target and are then extended. The extended 
products can serve as targets for further rounds of hybridization/extension. The net effect is to 
ampUfy a template sequence within the target, the 5' and 3* termini of the template being defined 
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by a,e locaUons of the two primers in the target As an alternative, if one or both of the primers 
contams a promoter sequence then the target oan be amplified (by transcription) using a RNA 
polymerase (as in TMA). 

The present mvention provides methods and Idts for amplifying and/or detecting a template 

> or target sequence in the SARS V viral nucleic acid. The invention provides a kit comprising 
pnmers for amplifying a template sequence contained within a SARSV nucleic acid target the 
tat composing a first primer and a second primer, wherem the first primer comprises a sequence 
substantially complementary to aportion of said template sequence and the second primer 
composes a sequence substantially complementary to a portion of the complement of said 

> template sequence, wherein the sequences within said primers which have substantial 
complementarity define the termini of tiie template sequence to be amplified. 

Kits of the invention may further comprise a probe which is substantially complementary 
to the template sequence and/or to its complement and wWch can hybridize thereto. This probe 
can be used in a hybridization techmque to detect amplified template, or to isolate (.-... "capture) 
the amplified template or Uie original target nucleic acid. 

Kits of the invention may fimher comprise primers and/or probes for generating and 

detectinganintemalstandard,ino»dertoaidquantitativemeasmements(«.«Fille«a/ 1997 

Biotechniques 23:34-36). 

Kits of the mvention may ftoher comprise a DNA polymerase, which wiU generally be a 
themiostable DNA polymerase where a non.isoU,ermal ampUfication process is to be used The 

tats may also comprise supplies of dNTPs.amagnesium salt (..^.MgCW, buffer solutions. «c 

Kits of the invention may comprise more Uian one pair of primers (e.g. for nested 
amphfication), and one primer may be common to more than one primer pair. The tat may also 
comprise more than one probe. 
Oligome r Probes and Primers 

h connection with the nucleic acid detection methods of Uie piesent invention described 
above, ohgomers having sequence similarity, or complementarity, to die SARSV genome are 
usefiil. The SARSV genome sequences mentioned herein may be used to produce probes and 
immets which can be used in assays for fte detection of nucleic acids in test samples. THe probes 
may be designed fix,m conserved nucleotide regions of the polynucleotides of interest or ftom 
nouKxinserved nucleotide regions of the polynucleotide of inteiest. The design of such probes for 
optimization in assays is within the skiU of tiiose of ordinary skiU in the art. GeneraUy. nucleic 
acid probes are developed from non^onserved or unique regions when maximum specificity is 
desired, and nucleic acid probes are developed fix,m conserved regions when assaying for 
nucleotide regions diat are closely related to. for example, different membe.^ of a multi-gene 
famdy or m related species like mouse and man. 
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Using as a basis the SARSV genome which can be found as described herein, and/or 
preferably conserved regions of the SARSV genome, and/or the particularly described primer 
and probe sequences as disclosed herein, oligomers of approximately 8 nucleotides or more can 
be prepared which hybridize with the positive strand(s) of SARSV RNA or its complement, as 
well as to SARSV cDNAs. These oligomers can serve as probes for the detection (including 
isolation and/or labeling) of polynucleotides which contain SARSV nucleotide sequences, and/or 
as primers for the transcription and/or replication of targeted SARSV sequences. The oligomers 
contain a targeting polynucleotide sequence, which is comprised of nucleotides which are 
complementary to a target SARSV nucleotide sequence; the sequence is of sufficient length and 
complementarity with the SARSV sequence to form a duplex which has sufficient stability for 
the purpose intended. For example, if the purpose is the isolation, via immobilization, of an 
analyte containing a target SARSV sequence, the oligomers would contain a polynucleotide 
region which is of sufficient length and complementarity to the targeted SARSV sequence to 
afford sufficient duplex stability to inmiobilize the analyte on a solid surface, via its binding to • 
the oligomers, under the isolation conditions. For example, also, if the oligomers are to serve as 
primers for the transcription and/or replication of target SARSV sequences in an analyte 
polynucleotide, the oligomers would contain a polynucleotide region of sufficient length and 
complementarity to the targeted SARSV sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex form with the target sequence, under the 
polymerizing conditions. For example, also, if the oligomers are to be used as label probes, or 
are to bind to multimers, the targeting polynucleotide region would be of sufficient length and 
complementarity to form stable hybrid duplex structures with the label probes and/or multimers 
to allow detection of the duplex. The oligomers may contain a minimum of about 4 contiguous 
nucleotides which are complementary to targeted SARSV sequence; usually the oligomers will 
contain a minimum of about 8 contiguous nucleotides which are complementary to the targeted 
SARSV sequence, and preferably will contain a minimum of about 14, 15, 16, 17, 18, 19, 20,21, 
22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides and up to about 50, 75, 100, 200 
contiguous nucleotides or more, which are complementary to the targeted SARSV sequence. 

Typically, for use in the amplification based metiiods (for example, PGR, RT-PCR, TMA) 
oligomers will be used as primer sets such that one member of the primer set has sequence 
similarity or complementarity to a more conserved (among coronaviruses) portion of the SARSV 
genome and the other member of the primer set has sequence similarity or complementarity to a 
less conserved portion. The primer sets can be used to amplify the target region in ways that are 
well known in the art. Typically, the 5' untranslated region (5'UTR) and the 3' unti-anslated 
region (3'UTR) are among the most conserved regions. Figure 8 shows an alignment of the 
5'UTR of several coronaviruses. Figure 10 shows an alignment of the 3'UTR of several 
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coronaviruses. Figures 9 and 11 show the sequences of preferred primers for amplification of the 
5TJTR and 3TJTR, respectively. Other primers and probes can readily be designed based on the 
sequence alignments provided herein. 

The oligomer, however, need not consist only of the sequence which is complementary to 
the targeted SARSV sequence. It may contain in addition, nucleotide sequences (e.^.promoters) 
or other moieties which are suitable for the purposes for which the oligomers are used. For 
example, if the oligomers are used as primers for the amplification of SARSV sequences yia, for 
example, PGR, they may contain sequences which, when in duplex, form restriction enzyme sites 
which faciUtate the cloning of the amplified sequences. For example, also, if the oligomers are to 
be used as "capture probes" in hybridization assays, they would contain in addition a binding 
partner which is coupled to the oligomer containing the nucleotide sequence which is 
complementary to the targeted SARSV sequence. Other types of moieities or sequences which 
are useful of which the oligomers may be comprised or coupled to. are those which are known in 
the art to be suitable for a variety of purposes, including the labeling of nucleotide probes. 

Table 4 (SEQ ID NOS: 1021-6020) shows forward and reverse primers that are useful for 
nucleic acid amplification of SARSV for diagnostic and screening methods. 

Preferred primers and probes for SARS nucleic acid detection for diagnostic and screening 
are SEQ ID NOS: 7332-7336 (forward primers). SEQ ID NOS: 7337-7341 (reverse primers) and 
SEQ ID NOS: 7342-7352 (probes). These primers and probes are useful for detection of 
sequences in the 3' UTR. 

Any of the above forward primers may be used in combination with any of the above 
reverse primers for amplification of SARSV nucleic acid. The amplified product may be 
detected (or captured) with any of the abovfc probes. Particularly preferred combinations of 
forward and reverse primers and the probes for detecting the amplified product include: Forward 
SEQ ID NO: 7332 with reverse SEQ ID NO: 7337, 7338, 7339 or 7341 and probe SEQ ID NO: 
7342; forward SEQ ID NO: 7333 or 7334 with reverse SEQ ID NO: 7340 and any of probes 
SEQ ID NO: 7343-7351; Forward SEQ ID NO: 7335 and reverse SEQ ID NO: 7340 or 7341 and 
any of probes SEQ ID NO: 7342-7352. Other combinations of forward and reverse primers and 
appropriate probes can readily be determined by those skilled in the art from the above 
information. 

Additional preferred primers and probes for SARS nucleic acid detection for diagnostic 
and screening are SEQ ID NOS: 7353-7362 (forward primers), SEQ ID NOS: 7363-7373 
(reverse primers) and SEQ ID NOS: 7374-7385 (probes). The primers and probes are useful for 
detection of sequences in the 5' UTR. 

The above primers may be used in combination for amplification of SARSV nucleic acid 
as follows: any of forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID 
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NO: 7363-7366, 7368 and the amplified product detected (or captured) \yith probes SEQ ID NO: 
7374; any of forward primers SEQ ID NO: 7357-7362 with any of reverse primers SEQ ID NO: 
7367, 7369-7373 and the amplified products detected (or captured) with any of probes SEQ ID 
NO: 7375-7385. Particularly preferred combinations of forward and reverse primers and probes 
are: Forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID NO: 7363- 
7366 and probes SEQ ID NO: 7374; forward primers SEQ ID NO: 7357-7358 with reverse 
primers SEQ ID NO: 7367. 7369 and probes SEQ ID NO: 7375 or 7376; Forward primers.SEQ 
ID NO: 7357-7359 with reverse primers SEQ ID NO: 7367, 7369 or 7370 and probe SEQ ID 
NO: 7375 or 7376. More preferred are combinations of SEQ ID NO: 7353 or 7354 with SEQ ID 
NO: 7363 or 7364 and probe SEQ ID NO: 7374. Other combinations of forward and reverse 
primers and appropriate probes can readily be determined by those skiUed in the art firom the 
above information. A particularly conserved octanucleotide sequence (SEQ ID NO: 7386) occurs 
in the 3TJTR of SARS (approximately 70-80 bases from the 3" end) and of several other 
Coronaviruses that may be particularly useful in identifying SARS V. Primers including in this 
region are preferably combined with reverse primers from regions of sequence that are more 
specific for SARS. 

In addition to the above, the intergenic sequence QS) that is characteristic of Coronavirus 
has been identified in SARSV (see above). The IS minimaUy comprises the sequence ACGAAC 
(SEQ ID NO: 7293) which occurs upstream of each open reading frame (ORF) in the viral 
genome. The 5UTR which includes the IS is spliced onto the 5' end of each viral mRNA at or 
adjacent to the site of the IS. Thus, primers comprising the IS or its complement are useful for 
amplifying viral nucleic acids, including cDNA made from the viral RNAs. The invention thus 
comprises a set of primers in which one primer comprises ACGAAC (SEQ ID NO: 7293) or its 
complement (SEQ ID NO: 7387) and one primer comprises any appropriate sequence from the 
SARS genome, or a complementary sequence. Useful probes for detecting and/or capturing the 
viral RNAs or cDNA naade from the viral RNAs may also comprise the IS sequence, or its 
complement, described above. 

One set of primers for amplification of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6562, 6563, 6564 and 6565. Of these, 6562 & 6564 are sense primers and 6563 & 
6565 are antisense primers. Primers SEQ ID NOS: 6562 & 6565 may be used in a first 
amplification, with a second nested amplification being performed using primers SEQ ID NOS: 
6563 & 6564. In some embodiments of the invention, these four primers are excluded. 

One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe (typically labeled e.g. 
with TAMRA and/or FAM) for the amplified sequence. In some embodiments of the invention, 
these primers and probe are excluded. 
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One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe (typically labeled e.g, 
with TAMRA and/or FAM) for the amplified sequence. In some embodiments of tiie invention, 
these primers and probe are excluded. 

One Mt for amplification of SARS sequences, particularly the nucleocapsid gene, uses 
SEQ ID NOs 6560 & 6561 as primers. In some embodiments of the invention, tiiese primers are 
excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6562, 6563, 
6564 & 6565 as primers. In some embodiments of tiie invention, these primers are excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6562, 6563, 6564 & 6565 
as primers. In some embodiments of the invention, these primers are excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6500, 6501, 6502 & 6503 
as primers. In some embodiments of the invention, these primers are excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6500, 6501, 
6502, 6503, 6562, 6563, 6564 & 6565 as primers. In some embodiments of tiie invention, tiiese 
primers are excluded. 

One kit for amplification and detection of SARS sequences, particularly by realtime {e.g. 
TaqManTM) PCR, uses SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe 
(typically labeled e.g. witii TAMRA and/or FAM) for tiie amplified sequence. In some • 
embodiments of the invention, these primers and probe are excluded. 

One kit for amplification and detection of SARS sequences, particularly by realtime (e.g. ■ ■ 
TaqMan-^) PCR, uses SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe ' 
(typically labeled e.g. with TAMRA and/or FAM) for tiie amplified sequence. In some 
embodiments of tiie invention, these primers and probe are excluded. 

One kit for amplification and detection of SARS sequences uses SEQ ID NOs 6562, 6565 
and 6568 as primers, and SEQ ID NOs 7396 and 7397 as probes (typically labeled e.g. witii 
TAMRA and/or FAM) for tiie amplified sequence. In some embodiments of tiie invention, tiiese 
primers and probe are excluded. 

One kit for amplification and detection of SARS sequences uses an oligonucleotide 
comprising SEQ ID NO: 9780 as a forward primer, an oligonucleotide comprising SEQ ID NO: 
9781 as a reverse primer, and an oligonucleotide comprising SEQ ID NO: 9782 as a probe. 

Preferred sequences for use with RT-PCR and LightCycler analysis include SEQ ID NOs 
6562. 6568. 6565, 7396 & 7397. In some embodiments of tiie invention, tiiese primers and probe 
are excluded. 

The preparation of the oligomers is by means known in the art, including, for example, by 
metiiods which include excision, transcription, or chemical syntiiesis. The target sequences 
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and/or regions of the genome which are selected to which the targeting polynucleotides of the 
oUgomers are complementary depend upon the purpose. For example, if the goal is to screen for 
the presence of SARSV in biological samples (e.g. blood, respiratory material, liver, lung), the 
preferred oligomers would be used as probes and/or primers, and would hybridize to conserved 
regions of the SARSV genome. Some of the conserved regions of the SARSV genome to which 
the oligomers may bind are described herein, for example, 5'UTR and SUTR. 

In the basic nucleic acid hybridization assay, single-stranded analyte nucleic acid (either 
DNA or RNA) is hybridized to a nucleic acid probe, and resulting duplexes are detected. The 
probes for SARSV polynucleotides (natural or derived) are a length which aUows the detection 
of unique viral sequences by hybridization. While 6-8 nucleotides may be a workable length, 
sequences of 10-12 nucleotides are preferred, and abouflS, 14, 15, 16, 17, 18, 19, 20, or 21 or 
more nucleotides or more appears optimal. Preferably, these sequences will derive from regions 
which lack heterogeneity. These probes can be prepared using routine methods, including 
automated oligonucleotide synthetic methods. Among useful probes, for example, are those 
derived from less conserved regions of the SARSV genome. Regions of the genome that are • 
typically less conserved can be readily ascertained from the sequence aKgnments provided 
herein, as weU as by any other well known techniques. A complement to any unique portion of . 
the SARSV genome will be satisfactory. For use as probes, complete complementarity is 
desirable, though it may be unnecessary as the length of the fragment is increased. • 

For use of such probes as agents to detect the presence of SARSV polynucleotides (for 
example in screening for contaminated blood or for diagnosing infected individuals), the 
biological sample to be analyzed, such as, without limitation, blood, serum, lung, liver • mucous, • 
kidney, saUva, or sputum, may be treated, if desired, to extract the nucleic acids contained 
therein. The resulting nucleic acid from the sample may be subjected to gel electrophoresis or 
other size separation techniques; alternatively, the nucleic acid sample may be dot blotted 
without size separation. In order to form hybrid duplexes with the targeting sequence of the 
probe, the targeted region of the analyte nucleic acid must be in single stranded form. Where the 
sequence is naturally present in single stranded form, denaturation will not be required. 
However, where the sequence is present in double stranded form, the sequence will be denatured. 
Denaturation can be carried out by various techniques known in the art. Subsequent to 
denaturation, the analyte nucleic add and probe are incubated under conditions which promote 
stable hybrid formation of the target sequence in the probe with the putative targeted sequence in 
the analyte, and the resulting duplexes containing the probe(s) are detected. 

Detection of the resulting duplex, if any, is usually accomplished by the use of labeled 
probes; alternatively, the probe may be unlabeled, but may be detectable by specific binding with 
a ligand which is labeled, either directly or indirectly. Suitable labels, and naethods for labeUng 
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probes and ligands are known in the art, and include, for example, radioactive labels which may 
be incorporated by known methods {e.g., nick translation or kinasing), biotin, fluorescent groups, 
chemiluminescent groups {e.g. , dioxetanes, particularly triggered dioxetanes), enzymes, 
antibodies, and the like. 

The region of the probes which are used to bind to the analyte can be made completely 
complementary to the SARSV genome. Therefore, usuaUy high stringency conditions are 
desirable in order to prevent false positives. However, conditions of high stringency should only 
be used if the probes are complementary to regions of the viral genome which lack 
heterogeneity. The stringency of hybridization is determmed by a number of factors during 
hybridization and during the washing procedure, including temperature, ionic strength, length of 
time, and concentration of formamide. These factors are outUned in, for example, Maniatis T. 
(1982). 

Variations of this basic scheme which are known in the art, including those which facilitate 
separation of the duplexes to be detected from extraneous materials and/or which amphfy the 
signal from the labeled moiety, may also be used. A number of these variations are reviewed in, 
for example: Matthews & Kricka (1988), Analytical Biochemistry 169:1; Landegren etal. 
(1988), Science 242:229; and Mittlin (1989), Clinical Chem. 35:1819. These and the following 
publications describing assay formats are hereby incorporated by reference herein. Probes 
suitable for detectmg SARSV in these assays are comprised of sequences which hybridize with 
target SARSV polynucleotide sequences to form duplexes with the analyte strand, wherein the 
duplexes are of sufficient stability for detection in the specified assay system. 

A suitable variation is, for example, one which is described in US Pat. No. 4,868,105, 
issued Sep. 9, 1989, and in EPO Pubhcation No. 225,807 (published Jun. 16, 1987). These 
publications describe a solution phase nucleic acid hybridization assay in which the analyte 
nucleic acid is hybridized to a labeling probe set and to a capturing probe set. The probe-analyte 
complex is coupled by hybridization with a solid-supported capture probe that is complementary 
to the capture probe set. This permits the analyte nucleic acid to be removed fiom solution as a 
solid phase complex. Having the analyte in the form of a solid phase complex facilitates 
subsequent separation steps in the assay. The labeling probe set is complementary to a labeled 
probe that is bound through hybridization to the solid phase/analyte complex. 

The polymerase chain reaction (PGR) is a technique for amplifying a desired nucleic acid 
sequence (target) contained in a nucleic acid or mixture thereof. In PGR, a pair of primers are 
employed in excess to hybridize to the complementary strands of the target nucleic acid. The 
primers are each extended by a polymerase using the target nucleic acid as a template.. The 
extension products become target sequences themselves, following dissociation from the original 
target strand. New primers then are hybridized and extended by a polymerase, and the cycle is 
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repeated to geometricaUy increase the number of target sequence molecules. PGR is disclosed in 
US Pat Nos. 4,683,195 and 4,683,202. which are incorporated herein by reference. 

The ligase Chain Reaction (LCR) is an alternate method for nucleic acid amplification. In 
LCR, probe pairs are used which include two primary (first and second) and two secondary (third 
and fourth) probes, all of which are employed in molar excess to target. The first probe 
hybridizes to a first segment of the target strand, and the second probe hybridizes to a second 
segment of the target strand, the first and second segments being contiguous so that the primary 
probes abut one another in 5' phosphate-3' hydroxyl relationship, and so that a ligase can 
covalently fuse or Ugate the two probes into a fiised product. In addition, a third (secondary) 
probe can hybridize to a portion of the first probe and a fourth (secondary) probe can hybridize 
to a portion of the second probe in a similar abutting fashion. Of course, if the target is initially 
double stranded, the secondary probes also will hybridize to the target complement in the first 
instance. Once the Ugated strand of primary probes is separated fi-om the target strand, it will 
hybridize with the third and fourth probes which can be ligated to form a complementary, 
secondary ligated product. It is important to realize that the ligated products are functionally 
equivalent to either the target or its complement. By repeated cycles of hybridization and 
ligation, amplification of the target sequence is achieved. This technique is described more 
completely in EP-A-320 308 to K. Backman published Jun. 16, 1989 and EP-A-0439182 to K. 
Backman a/., published Jul. 31, 1991, both of which are incorporated herein by reference. 

For amplification of mRNAs, it is within the scope of die present invention to reverse 
transcribe mRNA into cDNA foUowed by polymerase chain reaction (RT-PCR); or, to use a 
single enzyme for both steps as described in US Pat. No. 5,322.770, which is incorporated herein 
by reference; or reverse transcribe mRNA into cDNA followed by asymmetric gap ligase chain 
reaction (RT-AGLCR) as described by R. L. Marshall et al. , PGR Methods and Applications 
4:80-84 (1994), which also is incorporated herein by reference. 

TMA is described in detail in, e.g. , US Patent No. 5,399,491, the disclosure of which is 
incorporated herein by reference in its entirety. In one example of a typical assay, an isolated 
nucleic acid sample, suspected of containing a S ARSV target sequence, is mixed with a buffer 
concentrate containing the buffer, salts, magnesium, nucleotide triphosphates, primers, 
dithiothreitol, and spermidine. The reaction is optionally incubated at about 100°C for 
approximately two minutes to denature any secondary structure. After cooling to room 
temperature, reverse transcriptase, RNA polymerase, and RNAse H are added and the mixture is 
incubated for two to four hours at 37°C. The reaction can then be assayed by denaturing the 
product, adding a probe solution, incubating 20 minutes at 60°C, adding a solution to selectively 
hydrolyze die unhybridized probe, incubating the reaction six minutes at 60°C, and measuring 
the remaining chemdluminescence in a luminometer. 
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GeneraUy, TMA includes the following steps: (a) isolating nucleic acid, including RNA, 
from the biological sample of interest suspected of being infected with SARS V; and (b) 
combining into a reaction mixture (i) the isolated nucleic acid, (ii) first and second 
oligonucleotide primers, the first primer having a complexing sequence sufficiently 
5 complementary to the 3' terminal portion of an RNA target sequence, if present (for example the 
(+) strand), to complex therewith, and the second primer having a complexing sequence 
sufficiently complementary to the 3* terminal portion of the target sequence of its complement 
(for example, the (-) strand) to complex therewith, wherein the first oligonucleotide further 
comprises a sequence 5' to the complexing sequence which includes a promoter, (iii) a reverse 

10 transcriptase or RNA and DNA dependent DNA polymerases, (iv) an enzyme activity which 

selectively degrades -the RNA strand of an RNA-DNA complex (such as an RNAse H) and (v) an 
RNA polymerase which recognizes the promoter. 

• The components of the reaction mixture may be combined stepwise or at once. The 
reaction mixture is incubated under conditions whereby an oligonucleotide/target sequence is 

15 formed, including DNA priming and nucleic acid synthesizing conditions (including 

ribonucleotide triphosphates and deoxyribonucleotide triphosphates) for a period of time 
sufficient to provide multiple copies of the target sequence. The reaction advantageously takes 
place under conditions suitable for maintaining the stability of reaction components such as the 
component enzymes and witiiout requiring modification or manipulation of reaction conditions 

20 during the course of the amplification reaction. Accordingly, the reaction may take place under 
conditions that are substantially isothermal and include substantially constant ionic strength and 
pH. The reaction convenientiy does not require a denaturation step to separate the RNA-DNA 
complex produced by the first DNA extension reaction. 

Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis 

15 virus (AMV) reverse transcriptase (available fi-om, e.g. , Seikagaku America, Inc.) and Moloney 
murine leukemia virus (MMLV) reverse tiranscriptase (available from, e.g. , Bethesda Research 
Laboratories). 

Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid 
sequences (eitiier naturally occurring, produced synthetically or a product of a restiiction digest) 

SO that are specifically recognized by an RNA polymerase that recognizes and binds to that 

sequence and initiates the process of timiscription whereby RNA faranscripts are produced. The 
sequence may optionally include nucleotide bases extending beyond the actual recognition site 
for the RNA polymerase which may impart added stabiUty or susceptibility to degradation 
processes or increased ti-anscription efficiency. Examples of useful promoters include those 

i5 which are recognized by certain bacteriophage polymerases such as tiiose from bacteriophage 
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T3, T7 or SP6, or a promoter from E. coli. These RNA polymerases are readily available jBrom 
commercial somjces, such as New England Biolabs and Epicentre. 

Some of the reverse transcriptases suitable for use in the methods herein have an RNAse H 
activity, such as AMV reverse transcriptase. It may, however, be preferable to add exogenous 
5 RNAse H, such as E. coli RNAse H, even when AMV reverse transcriptase is used. RNAse H is 
readily available from, e.g., Bethesda Research Laboratories. 

The RNA transcripts produced by these methods may serve as templates to produce 
additional copies of the target sequence through the above-described mechanisms. The system is 
autocatalytic and amplification occurs autocatalytically without the need, for lepeatedly 
10 modifying or changing reaction conditions such as temperature, pH, ionic strength or the like. 

Detection may be done using a wide variety of methods, including direct sequencing, 
hybridization with sequence-specific oligomers, gel electrophoresis and mass spectrometry, 
these methods can use heterogeneous or homogeneous formats, isotopic or nonisotopic labels, as 
well as no labels at all. 

15 Suitable labeling moieties for attachment to primers and/or to probes used in methods of 

the invention include, but are not limited to: 5-FAM (also called 5-carboxyfIuorescein; also 
called Spux>(isobenzofuran-l(3H), 9'-(9H)xanthene)-5-carboxylic acid,3',6'-dihydroxy-3-oxo-6- 
carboxyfluorescein); 5-Hexachloro-Fluorescein ([4,7,2',4',5',7'-hexachloro-(3',6'- 
dipivaloylfluoresceinyl)-6-carboxylic acid ]); 6-Hexachloro-Fluorescein ([4,7,2',4',5',7'- 

20 hexachloro-(3',6'-dipivaloylfluoresceinyl)-5-carboxylic acid ]); 5-Tetrachloro-Fluorescein 
([4,7,2',7'-tetrachloro-(3',6'-dipivaloylfluoresceinyl)-5- carboxylic acid ]); 6-Tetrachl6ro- 
Huorescein ([4,7,2',7'-tetrachloro-(3',6'-dipivaloylfluoresceinyl)-6- carboxylic acid ]); 
tetramethylrhodamines (TAMRA), including (i) 5-TAMRA (5-carboxytetramethylrhodamine; 
Xanthylium, 9.-(2,4-dicarboxyphenyl)-3,6- bis(dimethylamino) and (ii) 6-TAMRA (6- 

15 carboxytetramethylrhodamine; Xanthylium, 9-(2,5-dicarboxyphenyl)-3,6- bis(dimethylamino); 
EDANS (5-((2-aminoethyl)amino)naphthalene- 1-sulfonic acid) ; 1,5-IAEDANS (5-((((2- 
iodoacetyl)amino)ethyl) amino)naphthalene-l-sulfonic acid); DABCYL (4-((4- 
(dimethylamino)phenyl) azo)benzoic add); Cy5 (Indodicarbocyamne-5); Cy3 
(Indodicarbocyanine-3); and BODIPY™ FL (4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s- 

10 indacene-3-propionic acid). Labelling of probes with both FAM (fi.g. at 5 ;) and TAMRA (e.g. at 
3') is preferred. 

Nucleic acids of the invention may be used in solution or may be bbund to a solid matrix or 
support e.g. in the format of a DNA array, 

As is readily apparent, design of the assays described herein are subject to a great deal of 
i5 variation, and many formats are known in the art. The above descriptions are merely provided as 
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guidance and one of sMU in the art can leadily modify the described protocols, using techniques 
well known in the art. 

One 302nt amplicon of the SARS virus is known as "BNI-1" (SEQ ID NO: 9927). It was 
sequenced at the Bemhard Nocht Institute, Hamburg, Germany. In April 2003 the BNI-1 
sequence was published on the WHO website ihttp://mm.y,ho.mt/csr/sars/pnmers/e,^ and in 
Dorsten et al., "Identification of a Novel Coronavirus in Patients with Severe Acute Respiratory 
Syndrome". New England Journal of Medicine, pubUshed online at http://mm.nejm.org. Both 
references are incorporated herein by reference in their entirety. Some embodiments of tfie 
invention do not encompass a nucleic acid consisting of SEQ ID NO: 9927. Some other 
embodiments of the invention do not encompass a nucleic acid comprising SEQ ID NO: 9927. 
Some embodiments of tiie invention do not encompass a polypeptide consisting of any one of 
SEQ ID NO^: 9928 to 9959. Some otiier embodiments of tiie invention do not encompass a 
nucleic acid comprising any one of SEQ ID NO^- 9928 to 9959. Some embodiments of tiie 
invention are not subject to these exclusions. 
Immunoassays 

The present invention utiHzes various immunoassay techniques for identifying individuals . 
exposed to SARSV and/or biological samples containing SARSV antigens or antibodies to 
SARSV. 

Immunoassay Formats 

The SARSV antigens may be employed in virtiially any assay format tiiat employs a 
known antigen to detect antibodies. A common feature of all of tiiese assays is tiiat tiie antigen is 
contacted with biological sample suspected of containing SARSV antibodies under conditions 
ttiat permit the antigen to bind to"any such antibody present in tfie component. Such conditions 
will typically be physiologic temperahire, pH and ionic strength using an excess of antigen. The 
incubation of the antigen witii tiie specimen is followed by detection of immune complexes 
comprised of tiie antigen. Alternatively, anti-S ARS V antibodies may be employed to detect tiie 
presence of SARSV antigens in a biological sample. Combination antigen/antibody assays are 
also contemplated; for example, as described for HCV detection in US patent 6,630,298. 

Design of tiie immunoassays is subject to a great deal of variation, and many formats are 
known in tiie art. Protocols may, for example, use solid supports, or immunopiecipitation. Most 
assays involve the use of labeled antibody or polypeptide; tiie labels may be. for example, 
enzymatic, fluorescent, chemiluminescent. radioactive, or dye molecules. Assays which amplify 
tiie signals from tiie immune complex are also known; examples of which are assays which 
utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such as EUSA 
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The immunoassay may be, without limitation, in a heterogeneous or in a homogeneous 
fonnat, and of a standard or competitive type. In a heterogeneous format, the polypeptide is 
typically bound to a solid matrix or support to facilitate separation of the sample from the 
polypeptide after incubation. Examples of solid supports that can be used are nitrocellulose {e.g., 
in membrane or microtiter well form), polyvinyl chloride {e.g., in sheets or microtiter wells), 
polystyrene latex {e.g., in beads or microtiter plates, polyvinylidine fluoride, diazotized paper, 
nylon membranes, microchips, high or low density biochips, recombinant immunoassays 
(RIBA), microfluidity devices, micromagnetic beads, activated beads, and Protein A beads. For 
example, Dynatech Immunlon or Immunlon 2 microtiter plates or 0.25 inch polystyrene beads 
CPrecision Plastic Ball) can be used in the heterogeneous format. The solid support containing 
the antigenic polypeptides is typically washed after separating it from the test sample, and prior 
to detection of bound antibodies. Both standard and competitive formats are known in the art. 

In a homogenous format, the test sample is incubated with the combination of antigens in 
solution. For example, it may be under conditions that will precipitate any antigen-antibody 
complexes which are formed. Both standard and competitive formats for these assays are known 
in.the art. 

In a standard format, the amount of SARSV antibodies in the antibody-antigen complexes 
is directly monitored. This may be accomplished by determining whether labeled anti- 
xenogeneic {e.g., anti-human) antibodies which recognize an epitope on anti-SARSV antibodies 
will bind due to complex formation. In a competitive format, the amount of SARSV antibodies 
in the sample is deduced by monitoring the competitive effect on the binding of a known amount 
of labeled antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-SARSV antibody (or in the case of competitive assays, 
the amount of competing antibody) are detected by any of a number of known techniques, 
depending on the format. For example, unlabeled SARSV antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed with a label, (e.g., an enzyme label). 

In an immunoprecipitation or agglutination assay format the reaction between the SARSV 
antigens and the antibody forms a network that precipitates from the solution or suspension and 
forms a visible layer or film of precipitate. If no anti-SARSV antibody is present in the test 
specimen, no visible precipitate is formed. 

There are at least three specific types of particle agglutination (PA) assays. These assays 
are used for the detection of antibodies to various antigens when coated to a support. One type of 
this assay is the hemagglutination assay using red blood cells (RBCs) that are sensitized by 
passively adsorbing antigen (or antibody) to the RBC. The addition of specific antigen antibodies 
present in the body component, if any, causes the RBCs coated with the jpurified antigen to 
agglutinate. 
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To eliminate potential non-specific reactions in the hemagglutination assay, two artificial 
caniers may be used instead of RBC in the PA. The most common of these are latex particles. 
However, gelatin particles may also be used. The assays utihzing either of these carriers are 
based on passive agglutination of the particles coated with purified antigens. 

The S ARS V antigens will typically be packaged in the form of a kit for use in these 
immunoassays. The kit will normally contain in separate containers the native SARS V antigen, 
control antibody formulations (positive and/or negative), labeled antibody when the assay format 
requires same and signal generating reagents (e.g., enzyme substi^te) if the label does not 
generate a signal directly. The native SARS V antigen may be already bound to a solid matrix or 
separate with reagents for binding it to tiie matiix. Instructions (e.g., written, tape, CD-ROM, 
etc.) for carrying out the assay usually wiU be included in the kit. 

Immunoassays that utilize the native SARSV antigen are additionally useful in screening 
blood for the preparation of a supply from which potentially infective SARSV is lacking. The 
method for the preparation of the blood supply comprises the following steps. Reacting a body 
15 component, preferably blood or a blood component, from the individual donating blood with 
native SARSV antigen to allow an immunological reaction between SARSV antibodies, if any, 
and the SARSV antigen. Detecting whether anti-S ARSV antibody~S ARSV antigen complexel 
are formed as a result of tiie reacting. Blood contributed to the blood supply is from donors tiiat 
do not exhibit antibodies to the native SARSV antigens. 
10 Production of Antibodies 

As explained above, the assay may utiUze various antibodies which may be bound to a 
sohd support, and tiiat detect antigen or antigen/antibody complexes formed when SARSV 
infection is present in the sample. These antibodies may be polyclonal or monoclonal antibody, 
preparations, monospecific antisera. human antibodies, or may be hybrid or chimeric antibodies, 
15 such as humanized antibodies, altered antibodies. F(ab')2 fragments, F(ab) fragments, Fv 
fragments, siiigle-domain antibodies, dimeric or trimeric antibody fragment constructs, 
minibodies, or functional fragments thereof which bind to the antigen in question. 

Antibodies are produced using techniques well known to those of skill in the art and 
disclosed in, for example, US Pat. Nos. 4,011,308; 4,722,890; 4,016,043; 3,876.504; 3,770,380; 
0 and 4,372,745. For example, polyclonal antibodies are generated by immunizing a suitlble' 
animal, such as a mouse, rat. rabbit, sheep or goat, with an antigen of interest. In order to 
enhance immunogenicity. the antigen can be linked to a carrier prior to immunization. Such 
carriers are well known to those of ordinary skill in the art. Immunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's 
5 complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously 
or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections 
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Of the antigen in saline, preferably using Freund's incomplete adjuvant. Antibodies may also be 
generated by in vitro immunization, using methods known in the art. Polyclonal antiserum is 
then obtained fix>m the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495-497, or a modification thereof, as described above. 

As explained above, antibody fragments which retain the ability to recognize the antigen of 
interest, wiU also find use in the subject immunoassays. A number of antibody fragments are 
known in the art which comprise antigen-binding sites capable of exhibiting immunological 
binding properties of an intact antibody molecule. For example, functional antibody fragments 
can be produced by cleaving a constant region, not responsible for antigen binding, from the 
antibody molecule, using e.g., pepsin, to produce F(ab')2 fragments. These fragments will contain 
two antigen binding sites, but lack a portion of the constant region from each of the heavy 
chains. Similarly, if desired. Fab fragments, comprising a single antigen binding site, can be 
produced, e.g., by digestion of polyclonal or monoclonal antibodies with papain. Functional 
fragments, including only the variable regions of the heavy and light chains, can also be 
produced, using standard techniques such as recombinant production or preferential proteolytic 
cleavage of immunoglobuUn molecules. These fragments are known as Fv. See. e.g. . Libar et al. 
(1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706- 
2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single-chain Fv ("sFv" or "scFv") polypeptide is a covalently Unked VH-VLheterodimer 
which is expressed from a gene fusion including Vh- and VL-encoding genes Unked by a peptide- 
encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883. A number of 
methods have been described to discem and develop chemical structures (linkers) for converting 
the naturally aggregated, but chemically separated, light and heavy polypeptide chains from an 
antibody V region into an sFv molecule which will fold into a three dimensional structure 
substantially similar to the structure of an antigen-binding site. See, e.g., US Pat. Nos. 5.091,513, 
5,132.405 and 4.946.778. The sFv molecules may be produced using methods described in the 
art. See, e.g., Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883; US Pat. Nos. 
5.091.513. 5.132.405 and 4.946,778. Design criteria include determining the appropriate length 
to span the distance between the C-terminus of one chain and the N-terminus of the other, 
wherein the linker is generally formed from small hydrophilic amino acid residues that do not 
tend to coil or form secondary structures. Such methods have been described in the art. See, e.g., 
US Pat. Nos. 5,091,513, 5,132.405 and 4,946,778. Suitable linkers generaUy comprise 
polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubility. 
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"Mini-antibodies" or "minibodies" wiU also find use with the present invention. 
Minibodies are sFv polypeptide chains which include oligomerization domains at their C- 
termini, separated from the sFv by a hinge region. Pack et al. (1992) Biochem 31:1579-1584. 
The oKgomerization domain comprises self-associating a-heUces. e.g., leucine zippers, that can 
. be further stabilized by additional disulfide bonds. The oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facilitate in vivo 
folding of the polypeptide into a ftmctional binding protein. GeneiaUy. minibodies are produced 
using recombinant methods weU known in the art. See, e.g.. Pack et aL (1992) Biochem 
31:1579-1584; Cumber etal (1992) J. Immunology 149B: 120-126. 
Production of SARS Antigens 

The S ARSV antigens used in the present invention are generally produced recombinantiy 
Thus, polynucleotides encoding SARSV antigens for use with the present invention can be made • 
usmg standard techniques of molecular biology. For example, polynucleotide sequences coding • • 
for the above-described molecules can be obtained using recombinant methods, such as by 
screening cDNA and genomic libraries from cells expressing the gene, or by derivingthe gene : 
from a vector known to include the same. Furthermore, the desired gene can be isolated directly 
from viral nucleic acid molecules, using techniques described in the ait, such as tiiose described • 

forHCVinHoughtonefa/..USPat.No.5,350.671.Thegeneencodingtheantigenofinterest . 
can also be produced synthetically, rather than cloned. The molecules can be designed with 
appropriate codons for the particular sequence (preferably optimum codons for the expression ■ 
host of choice). The complete sequence is tiien assembled from overlapping oligonucleotides 
prepared by standard metiiods and assembled into a complete coding sequence. See e g Edge 
(1981) Nature 292:756;NamhaiTetal. (1984) Science 223:1299; and Jay era/. (1984)/ Biol 
Chem. 259:6311. 

Thus, particular nucleotide sequences can be obtained from vectors harboring the desired 
sequences or syntiiesized completely or in part using various oligonucleotide syntiiesis 
techniques known in the art. such as site-directed mutagenesis and polymerase chain reaction 
(PGR) techniques where appropriate. See. e.g., Sambrook. supra. In particular, one method of 
obtaimng nucleotide sequences encoding the desired sequences is by annealing complementary 
sets of overiapping syntiietic oligonucleotides produced in a conventional, automated 
polynucleotide synthesizer, followed by ligation witi, an appropriate DNA ligase and 
amplification of the ligated nucleotide sequence via PGR. See, e.g., Jayaraman et al (1991) 
Proa. Natl. Acad. Set USA 88:4084-4088. Additionally, oligonucleotide directed synthesis 
(Jones et al. (1986) Nature 54:75-82), oligonucleotide directed mutagenesis of pre-existing 
nucleotide regions (Riechmami era;. (1988) //arure 332:323-327 and Verhoeyen er a/. (1988) 
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Science 239:1534-1536), and enzymatic fiUing-in of gapped oUgonucleotides using T4 DNA 
polymerase (Queen et cd. (1989) Proa. Natl Acad. SO. USA 86:10029-10033) can be used under 
the invention to provide molecules having altered or enhanced antigen-binding capabilities, 
and/or reduced immunogenicity. 

Once coding sequences have been prepared or isolated, such sequences can be cloned into 
any suitable vector or repUcon. Numerous cloning vectors aie known to those of skiU in the art, 
and the selection of an appropriate cloning vector is a matter of choice. Suitable vectors include, 
but are not limited to. plasmids, phages, transposons. cosmids, chromosomes (including artificial 
chromosomes, such as B AGs or YACs) or viruses which are capable of replication when 
associated with the proper control elements. 

The coding sequence is then placed under the control of suitable control elements, 
depending on tiie system to be used for expression. Thus, the coding sequence can be placed 
under the control of a promoter, ribosome binding site (for bacterial expression) and, optionally, 
an operator, so that the DNA sequence of interest is transcribed into RNA by a suitable 
transfoimant. The coding sequence may or may not contain a signal peptide or leader sequence 
which can later be removed by the host in post-translational processing. See. e.g., US Pat. Nos. 
4,431,739; 4,425,437; 4,338.397. ' . 

Li addition to control sequences, it may be desirable to add regulatory sequences which^ . 
aUow for regulation of the expression of the sequences relative to the growth of the host cell. 

Regulatory sequences are loiown to those of skiU in the art, and examples include those which 
cause the expression of a gene to be tumed on or off in response to a chemical or physical 
stimulus, including the presence of a regulatory compound. Other types of regulatory elements 
may also be present in the vector. For example, enhancer elements may be used herein to 
increase expression levels of the constructs. Examples include the SV40 early gene enhancer 
(Dijkema et al (1985) EMBO J. 4:761), the enhancer/promoter derived from the long terminal 
repeat (LTR) of the Rous Sarcoma Virus (Gorman et al. (1982) Proc. Natl. Acad. Set. USA 
79:6777) and elements derived from human CMV (Boshart et al. (1985) Cell 41:521), such as 
elements included in the CMV intron A sequence (US Pat. No. 5,688,688). The expression 
cassette may further include an origin of replication for autonomous replication in a suitable host 
cell, one or more selectable markers, one or more restriction sites, a potential for high copy 
number and a strong promoter. 

An expression vector is constructed so that the particular coding sequence is located in the 
vector with the appropriate regulatory sequences, the positioning and orientation of the coding 
sequence with respect to the control sequences being such that the coding sequence is transcribed 
under the "control" of the control sequences (».e., RNA polymerase which binds to the DNA 
molecule at the control sequences transcribes the coding sequence). Modification of the 
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sequences encoding the molecule of interest may be desirable to achieve this end. For example, 
in some cases it may be necessary to modify the sequence so that it can be attached to the control 
sequences in the appropriate orientation; i.e., to maintain the reading frame. The control 
sequences and other regulatory sequences may be ligated to the coding sequence prior to 
insertion into a vector. Alternatively, the coding sequence can be cloned directly into an 
expression vector which already contains the control sequences and an appropriate restriction 
site. 

As explained above, it may also be desirable to produce mutants or analogs of the antigen 
of interest. Methods for doing so are described in, e.g., Dasmahapatra et al, US Pat. No. 
5.843,752 and Zhang et al, US Pat. No. 5,990,276. Mutants or analogs of SARSV proteins for 
use in the subject assays may be prepared by the deletion of a portion of the sequence encoding 
the polypeptide of interest, by insertion of a sequence, and/or by substitution of one or more 
nucleotides within the sequence. Techniques for modifying nucleotide sequences, such as site- 
directed mutagenesis, and the like, are well known to those skilled in the art. See, e.g., Sambrook 
et al, supra; Kunkel, T. A. (1985) Proc. Natl. Acad. Sal USA (1985) 82:448; Geisselsoder et al. : 
(1987) BioTechniques 5:786; Zoller & Smith (1983) Methods Enzymol. 100:468; Dalbie- 
McFarland et al. (1982) Proc. Natl. Acad. Set USA 79:6409. 

The molecules can be expressed in a wide variety of systems, including insect, 
mammalian, bacterial, viral and yeast expression systems, all well known in the art. ' 

For example, insect cell expression systems, such as baculovims systems, are known to 
those of skill in the art and described in. e.g.. Summers & Smith. Texas Agricultural Experiment • ■ 
Station Bulletin No. 1555 (1987). Materials and methods for baculovirus/insect ceU expression 
systems are commercially available in kit form from, inter alia, Invitrogen, San Diego Calif. 
("MaxBac" kit). Similarly, bacterial and mammalian cell expression systems are weU known in 
the art and described in, e.g., Sambrook et al., supra. Yeast expression systems are also known in . 
the art and described in, e.g.. Yeast Genetic Engineering (Barr et al, eds., 1989) Butterworths. 
London. 

A number of appropriate host cells for use with the above systems are also known. For 
example, mammalian cell lines are known in the art and include immortalized cell lines available 
from the American Type Culture Collection (ATCC). such as. but not limited to, Chinese 
hamster ovary (CHO) cells. HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells 
(COS), human embryonic kidney cells, human hepatocellular carcinoma cells (e.g.. Hep G2) 
Madin-Darby bovine kidney ("MDBK") cells, as well as others. Similarly, bacterid hosts such as 
Kcoli, Bacillus subtilis, and Streptococcus spp.. will find use with the present expression 
constructs. Yeast hosts useful in the present invention include inter alia, Saccharonxyces 
cerevisiae, Candida albicans. Candida maltosa, Hansenula polymorpha. Kluyveromyces fragUis. 
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Kluyveromyces lactis, Pichia guillerimondii, Pichia pastoris, Schizosaccharomyces pombe and 
Yarrowia lipolytica. Insect cells for use with baculovirus expression vectors include, inter alia, 
Aedes aegypti. Autogmpha califonnica, Bombyxmon, Drosophila melanogaster, Spodoptera 
frugiperda, and Trichoplusia ni. 

Nucleic acid molecules comprising nucleotide sequences of interest can be stably 
integrated into a host cell genome or maintained on a stable episomal element in a suitable host 
cell using various gene delivery techniques well known in the art. See, e.g., US Pat. No. 
5,399,346. 

Depending on the expression system and host selected, the molecules are produced by 
growing host ceUs transformed by an expression vector described above under conditions 
whereby the protein is expressed. The expressed protein is theri isolated from the host cells and 
purified. If the expression system secretes the protein into growth media, the product can be 
purified directly from the media. If it is not secreted, it can be isolated from cell lysates. The 
selection of the appropriate growth conditions and recovery methods are within the skill of the 
art. . 

EXAMPLE 

For useful expression of SARS V antigens in Saccharomyces cerevisiae and Pichia 
pastoris, insect cells, and mammalian cells, the foUowing domains are cloned into expression 
vectors as listed in the Table below. The nt sequence numbers are firom the S ARSV sequence of 
SEQIDNO: 1. 

- RNA polymerase la: SARS nt 250-13398 
RNA polymerase lb: SARS nt 13399-21470 

- ORFns.envelope (homologous to ns2, hemagglutinin-esterase envelope glycoprotein, 
and spike glycoprotein): SARS nt 21477-25244 

Membrane: SARS nt 27849 - 28103 

Nucleocapsid: SARS nt 28105 - 29373 
A combination of PGR and synthetic oligos is used to create the above domains with restriction 
sites tailored to the following expression vectors: 

Expression host 
AD3/Saccharomyces 
Al)3/Saccharomyces 
GS 1 15/Pichia pastoris 
HVK-293/Transient 
transfection 
CHO stable cell Hne 
Ceil lines eixployed by Chiron 
include: Sf9, Sf21, Tn5 



Restriction ends 
Hindm/Saa 
EcoRI/Sam 
XbaVSall 

EcoKUBamm 
EcoTO/Xmal 



Vector 
PBS24.1 
pBS24.1 
pAOBlS 

pCMVkm2 
pCMVm 



Promoter 

ADH2/GAPDH 
ADH2/GAPDH/SOD fusion 
AOXI 

CMVp/Enhancer/IntronA 
CMVp/Enhancer/IntronA 



pBluBac4.5 Polyhedrin 
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IV. TREATMENT OF SARS INFECTION WITH RNAi 

RNA interference or "RNAi" is a term initially coined by Fire and co-workers to describe 
the observation that double-stranded RNA (dsRNA) can block gene expression when it is 
' introduced into worms (Fire et al. Nature 391, 806-811(1998)). RNAi most likely involves 
5 mRNA degradation, resulting in sequence-specific, post-trariscriptional gene silencing in many 
organisms. RNAi is a post-transcriptional process triggered by the introduction of double- 
stranded RNA which leads to gene silencing in a sequence-specific manner. RNAi has been . 
reported to occur naturally in organisms as diverse as nematodes, trypanosmes, plants and fungi. 
It most likely serves to protect organisms from viruses, modulate transposon activity and 

10 eliminate aberrant transcription products. 

The first evidence that dsRNA could achieve efficient gene silencing through RNAi came 
from studies on the nematode Caenorlmbditis elegans (Fire et al (1998) Nature, 391:806-811 
and US Patent No. 6,506,559). Later studies in the fruit fly Drosophila melanogaster 
demonstrated that RNAi is a two-step mechanism (Elbashir et al (2001) Genes Dev., 15(2): 188- 

15 200). First, long dsRNAs are cleaved by an enzyme known as Dicer in 21-23 nucleotides (nt) 

fragments, called small interfering RNAs (siRNAs). Then, siRNAs associate with a ribonuclease 
complex (termed RISC for RNA Induced Silencing Complex) which target this complex to 
coniplementary mRNAs. RISC then cleaves the targeted mRNAs opposite the complementary 
siRNA, which makes the mRNA susceptible to other RNA degradation pathways. 

20 RNAi is the phenomenon where dsRNA corresponding to a targeted DNA or RNA 

sequence can suppress or silence gene expression. Even though dsRNA can mediate gene- 
specific interference in mammaUan cells in some circumstances (Wianny & Zemicka-Goetz 

(2000) Nature Cell Biol. 2:70-75; Svoboda et al. (2000) Development 17:4147-4156) the use of 
RNAi in manmialian somatic cells is often limited due to the dsRNA triggering dsRNA- 

25 dependent protein kinase (PKR) which in turn inactivates translation factor eIF2a and causes a 
generalized suppression of protein synthesis and often times apoptosis (Gil & Esteban (2000) 
Apoptosis 5 :107-114). 

Recently, gene-specific suppression using siRNA of approximately 21 or 22 base pairs in 
length, corresponding to targeted RNA or DNA sequences, were shown to disrupt the expression 

30 of these targeted sequences in mammalian cells (Elbashir, S.M., et al. , Nature 41 1 : 494-498 

(2001) ). However, it is not clear that all RNA or DNA sequences of a mammalian cell's genome 
are susceptible to siRNA. It is also uncertain that every mammalian cell type possesses the 
necessary machinery for effecting gene-specific suppression using siRNA. Further, siRNA is of 
limited use for at least two reasons: the transient nature of the suppression efiect seen in cells 

35 where the siRNA has been administered; and in some instances the necessity for chemical 

synthesis of siRNAs before their use (Tuschl T., Nature Biotechnol., 20: 446-448 (2002)). Also 
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the instability of these short, synthetic RNAs makes it presents problems for any long term use of 
these siRNAs a pharmaceutical. 

To overcome this limitation, the present invention provides a modified siRNA with 
increased stability against nuclease degradation while still maintaining its ability to inhibit vkal 
replication via RNA interference. Such modification to the ribonucleotides in the siRNAs, adds 
a chemical group via chemical synthesis or in vitro transcription or longer modified RNAs can 
be prepared by either of these methods and cut into siRNAs using Dicer. 

Although other methods for gene-specific suppression have utilized chemically-modified 
nucleic acids, such as antisense and ribozyme technology, such modification destroys critical 
enzymatic activities necessary for the fimction of these technologies. In regard to antisense 
technology, modification of the ribonucleotides destroys RNaseH activity, whereas such 
modification abolishes the catalytic activity of ribozymes. 

Hie present invention provides a double-stranded RNA (dsRNA) molecule modified for 
protection against nuclease degradation with a length from about 10 to about 30 nucleotides 
which is able to inactivate a virus in a mammalian cell. The invention also provides a method of 
inactivating a virus by administering modified small interfering RNAs (siRNAs) that are 
modified so that they are nuclease or RNase resistant and retain the biological activity of being 
able to inhibit viral replication by targeting a RNA sequence in a viras. 

The invention is further directed to a method of making modified siRNAs that target a 
RNA sequence in a virus comprising preparing a modified-double stranded RNA (dsRNA) 
fra^ent containing at least one modified ribonucleotide in at least one strand that spans the 
genome of the virus; and cleaving the modified-dsRNA fragments with recombinant human 
Dicer resulting in more than one modified siRNA. 

The present invention provides a modified dsRNA molecule of from about 10 to about 30 
nucleotides which mediates targeted RNA interference in hepatic or S ARS-infected cells. 

As used herein RNA interference, or RNAi, is used to mean sequence-specific, or gene 
specific, suppression of gene expression (protein synthesis), without causing a generalized 
suppression of protein synthesis in cells harboring the siRNA. The invention is not limited to a 
particular theory of the mechanism of action of RNAi. For example, RNAi may involve 
degradation of messenger RNA (mRNA>in an RNA-induced silencing complex (RISC), 
preventing translation of the transcribed mRNA, or it may involve the methylation of genomic 
DNA, shunting transcription of the gene. The lack of gene expression caused by RNAi may be 
transient, lasting a short period of time, or it may be stable, or permanent, lasting an indefinite 
period of time. 

The term RNA is meant as is recognized in the art. Further, as used herein, RNA is used to 
mean double-stranded RNA (dsRNA) or single-stranded RNA (ssRNA) or a dsRNA with a 
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single-stranded overhang. dsRNAs within the meaning of the present invention includes short 
mterfering RNA (siRNA), micro RNA (miRNA) and small hairpin RNA (shRNA), 
Additionally, RNA is also used to mean messenger RNA (mRNA), transfer RNA (IRNA) or 
ribosomal RNA (rRNA), 
5 The present invention is directed to small interfering RNA (siRNA) which have been 

chemically modified to confer increased stability against nuclease degradation yet these siRNAs 
are stiU able to bind to target RNAs. that may be present in a cells. In the case where the target 
RNA is a virus specific RNA, the modified siRNAs are able to bind to the virus specific RNAs 
and inactivate the virus. A modified siRNA of the present invention comprises a modified 
10 ribonucleotide, wherein the siRNA is resistant to enzymatic degradation, such as RNase 

degradation, and yet retains the ability to inhibit viral replication. The modified siRNA is more- 
specificaUy modified at the 2' position of the ribose in the siRNA. The modification is at the 2' 
position of at least one ribonucleotide of said siRNA. Attachment of receptor-binding ligands to : 
siRNA molecules can be used to target the siRNA to a desired cell type. For example, 
15 attachment. of cholesterol at the 5'-end or 3'-end of the siRNA molecule, to give a cholesteryl • 
siRNA, can enhance targeting to hepatocytes. Other ligands for receptor mediated siRNA 
targeting to liver include HBV surface antigen, LDL, and others. 

More specifically, the siRNA is modified at at least one pyrimidine, at least one purine or a 
combination thereof. However, generally aU pyrimidines, or all purines or a combination of all 
20 pyrimidines and all purines of the SiRNA are modified. More preferably, the pyrimidines are 
modified and these pyrimidines are cytosine, a derivative of cytosine, uracil, a derivative of 
uracil or a combination thereof. It also is contemplated to modify the selected ribonucleotides in^ 
at least one sti:and of the siRNA or the ribonucleotides in both sti^ds of the siRNA are 
modified. 

15 The nucleotides containing pyrimidine bases found in RNA (cytidine and uridine) can be 

chemically modified by adding any molecule that inhibits RNA degradation or breakdown to the 
2- position of the ribose molecule. The 2'-modified pyrimidine nucleotide can be formed using a 
number of different methods. The 2' modification confers increased stability to the siRNA by ■■ 
making the siRNA impervious or resistant to nuclease activity. Thus, the 2' modified siRNA has 

JO a longer serum half-life and is resistant to degradation compared to unmodified siRNA. The 
siRNA also may be modified completely or partially. 

Regarding chemical modification of siRNAs, a molecule from the halide chemical group is 
preferably added to the ribonucleotide of the siRNA. Within the halides, fluorine is the preferred 
molecule but other chemical molecules, in addition to fluoro-, such as methyl-, methoxyethyl- 

15 and propyl-modifications can also we made. But the preferred medications is fluoro- 

modification, such as a 2'-fluoro-modication or a 2',2'-fluoio-modification.Thus, in a preferred 
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embodiment of the invention, the siRNA is modified by adding a fluorine molecule to the T 
carbon of the pyrimidine ribonucleotide. The siRNA may be fluorinated completely or partially. 
For example, only the cytosine nucleotides need be fluorinated. Alternatively, only the uracil 
nucleotide need be fluorinated but both uracil and cytosine can be fluorinated. Furthermore, 
only one strand, either sense or antisense, of the siRNA can be fluorinated. Even partial 2' 
fluorination the siRNA gives protection against nucleolytic degradation. Furthermore, it is 
important to note the 2* fluorinated siRNA is not toxic to cells, an unexpected result given that 
fluorine chemistry usually is toxic to living organisms. 

The siRNA of the present invention is designed to interact with a target nucleotide 
sequence. Most preferably this target nucleotide sequence is a disease producing agent or 
• pathogen of which one wishes to inhibit gene expression. More preferably, this target nucleotide 
sequence is in a virus genome, and further this virus genome is from a RNA virus or a DNA 
virus is selected from the group consisting of hepatitis C virus (HCV), hepatitis A virus, hepatitis 
B virus, hepatitis D virus, hepatitis E virus, Ebola virus, influenza viruSi rotavirus, reovirus, 
retrovirus, poliovirus, human papilloma virus (HPV), metapneumovirus and coronavirus. The 
most preferred virus is SARS virus. 

Modfied siRNA may be prepared in a nmnber of ways, such as by chemical synthesis, T7 
polymerase transcription, or by treating modified long double stranded RNA (dsRNA) prepared 
by one of the two previous methods with Dicer enzyme. Dicer enzyme can be used to cleave 
dsRNA that is about 500 base pairs to about 1000 base pairs in size, to created mixed 
populations of dsRNA from about 21 to about 23 base paurs in length. Furthermore, an 
unexpected result of using the Dicer enzyme method is that Dicer enzyme will cleave modified 
strands of dsRNA, such as 2' fluorinated modified dsRNA. Before development of this method, 
it was previously thought that Dicer would not be able to cleave modified siRNA. The Dicer 
method can be carried out using the Dicer siRNA Generation Kit available from Gene Therapy 
Systems, San Diego, CA. 

As used herein, small interfering RNA (siRNA) is defined as double- or single-stranded 
RNA of from about 10 to about 30 nucleotides in length, more preferably 12-28 nucleotides, 
more preferably 15-25 nucleotides, even more preferably 19-23 nucleotides and most preferably 
21-23 nucleotides. The length of a siRNA as used herein, is determined by the length of one of 
the strands of the RNA. For example, a siRNA that is described as 21 nucleotides long (a 21- 
mer) may comprise two opposite strands of RNA which anneal together for 19 contiguous base 
pairings. The two remaining nucleotides on one end of the molecule would not anneal to the 
opposite strand, thus creating an "overhang". The overhang can be at the 5' or the 3' end of the 
dsRNA. Preferably, the overhang is at the 3' end of the RNA strand. The length of a double- 
stranded RNA where the two opposite strands are not the same length will be designated by the 
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longer of the two strands. For example, a dsRNA comprising one strand which is 21 nucleotides 
long and anneals to an opposite strand that is 20 nucleotides long, will be considered, as used 
herein, a 21-mer. 

Preferably, the siRNA of the present invention will comprise a 3' overhang of about 2 to 4 
bases. More preferably, the 3' overhang is 2 nucleotides long. Even more preferably, the 2 
nucleotides comprising the 3' overhang are uridine (U). 

In one embodiment, the invention provides an RNA molecule comprising a nucleotide 
sequence at least 80% identical to the nucleotide sequence of the target agent or virus. 
Preferably, the RNA molecule of the present invention is at least 90%, 95%, 96%, 97%, 98%, 
99% or 100% identical to the nucleotide sequence of the target agent or virus. 

As a practical matter, whether any particular nucleic acid molecule is at least 90%, 95%, 
96%, 97% 98%, 99% or 100% identical to the nucleotide sequence of the target agent or virus 
can be determined conventionally using known computer programs such as the Bestftt program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, * 
University Research Park, 575 Science Drive, Madison, Wis. 5371 1). Bestfit uses the local 
homology algorithm of Smith & Waterman {Advances in Applied Mathematics 2:482-489 
(1981)) to find the best segment of homology between two sequences. When using Bestfit or any 
other sequence alignment program to determine whether a particular sequence is, for instance, 
95% identical to a reference sequence according to the present invention, the parameters are set, ■. 
of course, such that the percentage of identity is calculated over the fuU length of the reference 
nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in; 
the reference sequence are allowed. 

The present invention provides a method of inactivating a target agent or preferably a virus 
in a patient comprising administering to the patient a modified siRNA in an effective amount to 
inactivate the targeted agent or virus. RNA interference towards a targeted DNA segment in a 
cell can be achieved by administering a dsRNA molecule or siRNA to the cells, wherein the 
nucleotide sequence of the dsRNA molecule corresponds to the nucleotide sequence of the 
targeted DNA segment. Preferably, the RNA molecule used to induce targeted RNAi is siRNA. 

Gene suppression, targeted suppression, sequence-specific suppression, targeted RNAi or 
sequence-specific RNAi are used interchangeably herein. Furthermore, sequence-specific 
suppression, as used herein, is determined by separately assaying the levels of the protein 
targeted for suppression in cells containing the siRNA (experimental ceUs) and in cells not 
containing the identical siRNA (control cells), and comparing the two values. Furthermore, the 
experimental and control cells must be derived firom the same source and same animal. For 
example, the control and experimental cells can be, but are not limited to. normal human hepatic 
cells as ceU culture in vitro, or they can derived from a hepatocellular carcinoma. Further, the 
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control and experimental cells used in determining the level or quantity of gene suppression must 
be assayed under similar, if not identical, conditions. 

As used herein the phrase "targeted DNA segment" is used to mean a DNA sequence 
encoding, in whole or in part, an mRNA for a targeted protein, including introns or exons, where 
suppression is desired. DNA segment can also mean a DNA sequence that normally regulates 
expression of the targeted protein, including but not limited to the promoter of the targeted 
protein. Furthermore, the DNA segment may or may not be a part of the cell's genome or it may 
be extrachromosomal, such as plasmid DNA. 

The present invention is further directed to inactivating a virus in a patient comprising 
administering to a patient a modified siRNA in an effective amount to inactivate the virus. The 
siRNA is preferably about 10 to about 30 nucleotides in length, more preferably 12-28 
nucleotides, more preferably 15-25 nucleotides, even more preferably 19-23 nucleotides and 
most preferably 21-23 nucleotides. The method preferably utilizes a 2' modified siRNA that is 
modified at the 2' position of at least one ribonucleotide of said siRNA. The method utilizes a 
siRNA that is modified with chemical groups selected from the group consisting of fluoro-, 
methyl-, methoxyetiiyl- and propyl-modification. The fluoro-modification is preferred and either 
a 2'-fluoro-modication or a 2',2'-fluoro-modification is useful in the present invention and 
preferred. 

The modification may be at the pyrimidines, the purines or a combination thereof of the 
siRNA are modified. More preferably the pyrimidines are modified, such as cytosine, a 
derivative of cytosine, uracil, a derivative of uracil or a combination thereof. In one 
embodiment, at least one strand of the siRNA contains at least one modified nucleotide and in an 
alternate embodiment, oth strands of the siRNA contains at least one modified nucleotide. 

The method is intended to target disease causing agents or pathogens, an more paiticularly 
viruses, which can be either a RNA virus or a DNA virus, which are selected fix)m the group 
consisting of hepatitis C virus (HCV), hepatitis A vims, hepatitis B viras, hepatitis D virus, 
hepatitis E virus, Ebola virus, influenza virus, rotavirus, reovirus, retrovirus, poliovirus, human 
papilloma virus (HPV), metapneumovims and coronavirus. More preferably the target virus is a 
S ARS virus. The present method utilizes a siRNA prepared by (a) identifying a target nucleotide 
sequence in a viras genome, preferably SARS virus, for designing a small interfering RNA 
(siRNA); and (b) producing a siRNA that has been modified to contain at least one modified 
nucleotide. More preferably, the siRNA comprises a dsRNA molecule with a first strand 
ribonucleotide sequence corresponding to a nucleotide sequence corresponding to a target 
nucleotide sequence in said virus and a second strand comprising a ribonucleotide sequence 
complementary to said target nucleotide sequence, wherein said first and second strands are 
separate complementary stirands that hybridize to each other to form said dsRNA molecule, and 
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(toherwherein the first str^jdribcmuoleoflde sequence. a» second stn«dribo„ucl^^ 

sequ^ce or both a.e firs, and second stnmdribonucletide sequences compri 

modified nucleotide. In this »efl,od. fl» target nucleotide sequence comprises a conserved 

™«=J«'««le««J''encenecessaryforSARSvirusrepHcation.anda,eco„servednucleotide 

5 ^-quenceis selected fiom the group consisting ofSEQ ID NO: 7292. SEQ ID NO- 7293 SEOID 

NO:7294.SEQIDNO:7295.SEQIDNO:7296.SEQIDNO:7297.SBQn>N0.7S8S 

^Na7299.SBQIDNO: 7300 andSEQlBNO: 7301. Pre^erabl^theleotid^^^^^ 

selectedfiomthegroupconsistingofSEQIDNO:7292andSEQIDNO:7293. SHUmore 
preferably, the nucleotide sequence is SEQ ID NO: 7293. 

' descH^T"^ ?f " ""^ ^'"^ ""^fi^" ribonucleotides as 

d<^bedhete.n. Fur*er..henK^edribonucl»^^ 

.s mcorpontted mto satd siRNA by chemical synthesis or enzymatic synthesis 

The S.RNA disclosed in this application may or may not have a 5' triphosphate group 
The modified SiRNA is administered to a pauent by a method selected from the group 

T^emod^edsiRNAaccumulates in an organ, tissue orbody system Of the patient that are a,7 
liver, gastromtestmal tract, respiratory tract, cervix or skin 

as S A^r " 'T"°" '"""^ ' """"""^"^ « virus, such 

asSARS ™.s.mcellspositiyeforSARSviruscomprising.ransfec.ingSARS-posiaveceUs 
^vtU, a vector that directs the expression of modified siRNA which is specific for S ARS. "me 

c^a^evaluatedtodetenmneifamattointhecellshasbeeninUbitedbythemodifi^ 

fte subject can be a pnma.e..ncluding non-human and humans. Il.e terms subject and paUent 
can be used interchangeably. 

-nie treatment envisioned by flte current invention can be used for subjects with a pte- 

methodofthecu.rentmventioncanbeused,oco.xec.orcompensateforceUularor 
physiological abnonnaUties involved in conferring susceptibility to viral infections in patients. 

and/or.oallev.a.esymptomsofavirali„fectioninpatients,orasapreventativemeaslm 

The method of treating a patient having a viral infection involves adminisUaUon of 
capons to thesubjects. As used herein, composition can mean a pute compound, agen, or 
ubstanceoramrxtureoftwoormorecompounds,agentsorsubstances.AsuJherein,the 
erm agent, substance or compound is intended to mean a protein, nucleic acid, carbohydt^te 
lipid, polymer or a small molecule, such as a drug. 
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In one embodiment of the current invention, the composition administered to the subject is 
a pharmaceutica] composition. Further, the pharmaceutical composition can be administered 
orally, nasally, parenterally, intrasystemically, intraperitoneally, topically (as by drops or 
transdermal patch), bucally, or as an oral or nasal spray. The term "parenteral," as used herein, 
5 refers to modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrastemal, subcutaneous and intraarticular injection and infusion. The pharmaceutical 
compositions as contemplated by the current invention may also include a phaimaceutically 
acceptable carrier. 

By "pharmaceutically acceptable carrier" is intended, but not limited to, a non-toxic solid, 
10 semisolid or liquid fiUer, diluent, encapsulating material or formulation auxiliary of any type, 
such as liposomes. 

A pharmaceutical composition of the present invention for parenteral injection can 
comprise pharmaceutically acceptable sterile aqueous or nonaqueous solutions, dispersions, 
suspensions or emulsions as well as sterile powders for reconstitution into sterile injectable 
15 solutions or dispersions just prior to use. Examples of suitable aqueous and nonaqueous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycol^ 
polyethylene glycol, and the like), carboxymethylcellulose and suitable mixtures thereof, 
vegetable oils (such as olive oil), and injectable organic esters such as ethyl oleate. Proper 
fluidity can be maintained, for example, by the use of coating materials siich as lecithin, by the • 
iO maintenance of the required particle size in the case of dispersions, and by the use of surfactants. 
The compositions of the present invention can also contain adjuvants such as, but not 
limited to, preservatives, wetting agents, emulsifying agents, and dispersing agents. Prevention 
of the action of microorganisms can be ensured by the inclusion of various antibacterial and 
antifungal agents, for example, paraben, chlorobutanol, phenol, sorbic acid, and the like. It can 
15 also be desirable to include isotonic agents such as sugars, sodium chloride, and the like. 
Prolonged absorption of the injectable pharmaceutical form can be brought about by the 
inclusion of agents which delay absorption such as aluminum monostearate and gelatin. 

In some cases, to prolong the effect of the drugs, it is desirable to slow the absorption from 
subcutaneous or intramuscular injection. This can be accomplished by the use of a liquid 
•0 suspension of crystalline or amorphous material with poor water solubility. The rate of 

absorption of the drug then depends upon its rate of dissolution which, in turn, can depend upon 
crystal size and crystalline form. Alternatively, delayed absorption of a parenterally 
administered drug form is accomplished by dissolving or suspending the drug in an oil vehicle. 
Injectable depot forms are made by forming microencapsule matrices of the drug in 
15 biodegradable polymers such as polylactide-polyglycolide. Depending upon the ratio of drug to 
polymer and the nature of the particular polymer employed, the rate of ding release can be 
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controlled. Examples of other biodegradable polymers include poly(orthoesters) and 
poly(anhydrides). Depot injectable formulations are also prepared by entrapping the drug in 
liposomes or microemulsions which are compatible with body tissues. 

The injectable formulations can be sterilized, for example, by filtration through a bacterial- 
5 retaimng filter, or by incorporating sterilizing agents in the form of sterile solid compositions 
which can be dissolved or dispersed in sterile water or other sterile mjectable medium just prior 
to use. ^ 

Solid dosage forms for oral administration include, but are not limited to, capsules' tablets 
puis, powder., and granules. In such soUd dosage forms, the active compounds are mix«l ' 
• at least one item pharmaoeutically acceptable excipient or carrier such as sodium citrate or 
drcalcium phosphate and/or a) fillers or extenders such as starches, lactose, sucmse glucose 
manmtol, and silicic acid, b) binders such as, for example, carboxymethylcellulose, alginates 
gelalm, polyvinylpynOlidone, sucrose, and acacia, c) humectants such as glycerol d)- 
dismtegratrng agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid 
certain sjicates. and sodium carbonate, e) solution retarding agents such as paraffin f) 
absorption accelerators such as quaternary ammonium compounds, g) wetting agen,^ such as. for 
example, acetyl alcohol and glycerol monostearate, h) absorbents such as tooBn and bentomte 
clay, and i) lubricants such as talc, calcium stearate. magnesium stearate. soUd polyethylene 
glycols, sodimn lauryl sulfate, and mixtures thereof. In flre case of capsules, tablets and pills, the 
dosage fonn can also comprise buffering agents. 

Solid compositions of a similar type can also be employed as fillers in soft and hard fiUed 
gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight 
polyethylene glycols and the like. 

■">«^°«<idosageformsoftabIets,dragees,capsules,pills.andgranulescanbep,epa.ed 
w,th coatings and shells such as enteric coatings and other coatings well known in the 
pharmaceutical formulating art ll,ey can optionaUy contain opacifying agents and can also be 
of a composition U>at they release the active ingredient(s) only, or preferentiaUy, in a certain part 
of the mtesanal ttact, opUonaUy, in a delayed mam,er. Examples of embedding compositions 
which can be used include polymeric substances and waxes. 

The active compounds can also be in micro-encapsulated form, if appropriate, with o«= or 
more of the above-mentioned excipients. 

Liquid dosage forms for oral administration include, but are not limited to 
pharmaceuticaUy acceptable emulsions, solutions, suspensions, syrups and elixir. In addition to 
the active compounds, the liquid dosage forms can contain inert diluents commonly used in the 
art such as. for example, water or other solvents, solubilizing agents and emulsifiers such as 
ethyl alcohol, isopropyl alcohol, etiiyl carix,nate. ethyl acetate, benzyl alcohol, benzyl benzoate. 
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propylene glycol, 1,3-butylene glycol, dimethyl foraiamide, oils (in particular, cottonseed, 
groundnut, com, germ, olive, castor, and sesame oils), glycerol, tetrahydrofurfuryl alcohol, 
polyethylene glycols and fatty acid esters of sorbitan, and mixtures thereof. 

Besides inert diluents, the oral compositions can also include adjuvants such as wetting 
agents, emulsifying and suspending agents, sweetening, flavoring, and perfuming agents. 

Suspensions, in addition to the active compounds, can contain suspending agents as, for 
example, ethoxylated isostearyl alcohols, polyoxyethylene sorbitol and sorbitan esters, 
microcrystalline cellulose, aluminum metahydroxide, bentonite, agar-agar, and tragacanth, and 
mixtures thereof. 

Altematively, the composition can be pressurized and contain a compressed gas, such as 
nitrogen or a liquefied gas propellant. The liquefied propeUant medium and indeed the total 
composition is preferably such that the active ingredients do not dissolve therein to any 
substantial extent. The pressurized composition can also contain, a surface active agent. The 
surface active agent can be a liquid or solid non-ionic surface active agent or can be a solid 
anionic surface active agent. It is preferred to use the soHd anionic surface active agent in the 
form of a sodium salt. 

The compositions of the present invention can also be administered in the form of 
liposomes. As is known in the art, liposomes are generally derived from phospholipids or other 
lipid substances. Liposomes are formed by mono- or multi-lamellar hydrated liquid crystals that 
are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
metaboUzable lipid capable of forming liposomes can be used. The present compositions in 
liposome form can contain, in addition to the compounds of the invention, stabilizers, 
preservatives, excipients, and the like. The preferred Upids are the phosphoHpids and the 
phosphatidyl choHnes (lecithins), both natural and synthetic. Methods to fonn liposomes are 
known in the art (see, for example, Prescott, Ed., Meth. Cell Biol. 14:33 et seq (1976)). 

One of ordinary skill will appreciate that effective amounts of the agents of the invention 
can be determined empirically and can be employed in pure form or, where such forms exist, in 
phaimaceutically acceptable salt, ester or prodrug form. The agents can be administered to a 
subject, in need of treatment of viral infection, as pharmaceutical compositions in combination 
with one or more pharmaceutically acceptable excipients. It will be understood that, when 
administered to a human patient, the total daily usage of the agents or composition of the present 
invention will be decided by the attending physician within the scope of sound medical 
judgement. The specific therapeutically effective dose level for any particular patient will 
depend upon a variety of factors: the type and degree of the cellular or physiological response to 
be achieved; activity of the specific agent Or composition employed; the specific agents or 
composition employed; the age, body weight, general health, sex and diet of the patient; the time 
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Of administration, route of administration, and rate of excretion of the agent; the duration of the 
treatment; drugs used in combination or coincidental with the specific agent; and like factors 
well known in the medical arts. For example, it is well within the skill of the art to start doses of 
the agents at levels lower than those required to achieve the desired therapeutic effect and to 
gradually increase the dosages until the desired effect is achieved. 

Dosing can also be arranged in a patient specific manner to provide a predetermined 
concentration of the agents in the blood, as determined by techniques accepted and routine in the 
art. Thus patient dosaging can be adjusted to achieve regular on-going blood levels, as measured 
by HPLC, on the order of from 50 to 1000 ng/ml. 

It will be readily apparent to one of ordinary skill in the relevant arts that other suitable 
modifications and adaptations to the methods and applications described herein can be made 
without departmg from the scope of the invention or any embodiment thereof. 

The modified siRNA is prepared by custom chemical synthesis by Dharmacon, at 
Lafayette CO. Each C and U within the siRNA duplex (GL2). has been substituted with 2'-F-U ■ 
and 2'-F-C except for the 3'-end overhang, which was dTdT. 

To test the stabihty of 2' chemically modified siRNA compared to unmodified siRNA 
(SiRNA), the following experiment is performed. 4ngs of siRNA are added to a 20 mL volume of 
80% human serum from a healthy donor. This mixture is incubated at 37«C for various times 
ranging from 1 minute up to 10 days. The same process is performed for 2' fluorine modified 
SiRNA (2'-F SiRNA). When the incubation process is finished, the mixtures are placed on ice and 
then immediately separated by PAGE along with a ^^P-siRNA control. The 2' modified siRNA 
is stable as compared to unmodified siRNA. 

y- IDENTIFICATION OF THERAPEUTICAU.Y ACTIVE AGENTS FORTREATAmN^ 
OF SARS VIRUS INFECTION 

The invention provides methods for treating SARS by administering therapeutically active 
agents, such as small molecule compounds, to a mammal, as well as methods of identifying 
therapeutically active agents, such as potent small molecules, for the treatment of SARS virus 
infection. 

In one aspect of the invention a method of identifying a therapeuticaUy active agent is 
provided comprising: (a) contacting the therapeutically active agent with a cell infected with the 
SARS virus; (b) measuring attenuation of a SARS related enzyme. 

In a more particular embodiment, the therapeutically active agent is a small molecule In 
another more particular embodiment, the therapeuticaUy active agent is a nucleoside analog (e.g. 
Ribavirin), m another more particular embodiment the small molecule is a SMIP or peptidic 
immunomodulating compound. In another more particular embodiment the therapeutically 
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active agent is a peptoid, oUgopeptide, or polypeptide. In another embodiment the SARS related 
enzyme is SARS protease. In another embodiment the SARS related enzyme is SARS 
polymerase. In still another embodiment the SARS related enzyme is a kinase. In still another 
embodiment, the SARS related enzyme is a protease. The fiirin inhibitor peptidyl 
chloromethylketone prevents blocks cell-cell fusion after MHV infection (de Haan et at. (2004) J 
Virol), which offers guidance for SARS therapy. 

The invention includes a cell-based assay that can be used to screen for and identify a 
therapeutically active agent for the treatment of SARS virus infection. Therapeutically active 
agents of the invention include agents that inhibit, prevent or reduce the replication of a SARS 
virus. Such agents can be identified by infecting a cultured cell (such as, for example, VERO 
cells) with a SARS virus and evaluating tiie impact of potential antivrrefl compounds on SARS 
virus replication. Assays to measure the effect of a potential antiviral compound on virus 
replication are known in the art and may be based on a variety of parameters. 

The cell-based assay may be used in a high-throughput screen to identify therapeutically 
active compounds from chemical libraries comprising potential antiviral compounds. 
Therapeutically active compounds suitable for use in tiie invention may inhibit any SARS viral 
target that is essential for viral replication in whole cells. Efficacy (the ability of a compound to 
inhibit or inactivate tiie target, be it vural or cellular, tfiat results in tiie reduction of virus in tiie 
culture) of tiie therapeutic agent is measured by assessing the viability and/or the proliferation of 
surviving cells in a SARS virus infected cell culture. 

A number of methods can be used to measure cell viability are known in the art, such as 
assays measuring cellular enzymes, proteins, nucleotide triphoshates (e.g. ATP), nucleic acids 
(e.g. host cell mRNA (e.g. GAPDH) or rRNA sequences) or cellular metabolites such as MTT or 
MTS. Li addition, fluorescent (including, for example HSV paper) or non-fluorescent dyes (e.g. 
propidium diiodide) or labeling of DNA can be used to measure indications of cell viability 
and/or proliferation. 

Alternatively, efficacy of a compound or sample can be determined by dircctiy measuring 
the amount of virus or viral products in the culture. Metiiods for measuring tiie amount of virus, 
viral genome or viral products include: PGR, RT-PCR, TMA, reporter proteins with fluourescent 
or luminescent qualities or enzymatic functions (e.g., luciferase, alkaline phosphatase, GFP) or 
proteins tfiat can be detected by antibodies (e.g. EGF) that might be incoiporated into the viral 
genome prior to infection of the cell culture. Further, viral products such as viral proteins can be 
measured by ELISA or enzymatic activities. Methods for identifying viral polynucleotides, viral 
proteins and antibodies specific to viral proteins are discussed above. 

Potential antiviral compounds are applied to the cell-based assay at a concentration of 
approximately 10 jjM and compound classes having therapeutic effect are identified by 
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measuring the parameter of choice (such as cell viability/proliferation or the virus or viral 
genome or a viral product be it viral in origin or non-virus in orgin). Once compounds are 
identified as having activity, they are resynthesized, and analoged. Starting with the identified 
compound, many analogs and new compounds are synthesized during consecutive optimization 
cycles of synthesis, biological profiling and modeling techniques to optimize the to the lead 
structure until in vivo activity is elucidated and optimized. 

Cells suitable for use in the assay include the cells described above as suitable for vaccine 
production. Preferably, the ceUs are African green monkey kidney ceUs (Vero) cells. Human 
embyronic lung fibroblasts or normal human diploid fibroblasts may also be used in the 
invention. 

In one embodiment, the invention includes a fluorescence based cytopathogenicity assay to 
measure the effect of a potential antiviral compound on a cell-based assay. One example of a 
fluorescence based cytopathogenicity assay is illustrated below. 

1 X lO'* Vero cells per well of a microtiter plate (MTP) are infected with a defined amount 
of SARS virus selected within the foUowing ranges for optimal MOI: 5-10. 10-25, 25-50 50- ■ 
100, 100-500, or 500-1000 PFU in a total volume of 200 ,il media (M199 medium supplemented 
with 5% FCS, 2 mM glutamine, 100 lU/ml penicillin and 100 /*g/ml streptomycin) in the 
presence or absence of the potential antiviral compound and incubated for at least 1, 2, 3, 4, 5, 6, 
or 7 days at 37°C, 5% CO2. The wells of the MTP are washed with PBS (200 /il) a^d thin filled 
with 200 ill PBS containing 10 ^g/ml fluorescein diacetate. After a 45-min incubation at room 
temperature, fluorescence is measured at 485 nm excitation and 538 nm emission wavelengths. - 
IC50 values are determined by a nonlinear plot of antiviral activity as a function of drug 
concentration. 

Other cell based assays are known in the art and include, among others, methods of GFP 
detection and Luc detection. In addition, a Promega kit is commercially available that provides 
additional methods of measuring cell viability, etc. 

In one embodiment, the invention includes a method of measuring the efficacy of a 
potential antiviral compound using RT-PCR to detect the levels of SARS viral RNA in the cell 
based assay. Methods of using RT-PCR are known in the art. One example of such an assay is 
described below. 

5 X 10* Vero cells are seeded in tissue culture. Hasks containing the cells are incubated 
over night at 37°C, 5% CO.. The cells are infected (m.o.i. = 1) with SARS virus in the presence 
and absence of potential antiviral compounds. OptionaUy. the cells may be pietieated with the 
potential compound prior to infection. Li either case, a suitable control cell assay is also 
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The RNA of infected ceUs is purified at 2 h (IIL54), 12 h (UL8) and 16 h (UL13) after 
infection, (Qiagen) RNA purification (ElNeasy kit; 40 fil elution) and quantified (absorption at 
260 nm). The RNA (2 /ig) is reverse transcribed with a specific primer (2 pmol, using one of the 
primer pairs described herein) into cDNA according to theSuperscript II protocol (Invitrogen). 
Aliquots (2 fil) of the reverse transcription reaction are amplified by PGR. Fragments of the 
appropriate target SARS gene, i.e., a gene encoding a SARS enzyme, are amplified in 30 cycles 
(UL54 and UL8: 3 min, 94°C hot start; 1 min, 94°C denaturation; 1 miri, 55°C annealing; 1 imn, 
72 °C polymerization. UH3: 3 min, 94 °C hot start; 1 min, 94 °C denaturation; 1 min, 60 °C 
annealing; 1 min. 72*'C polymerization) by PGR (Taq-Polymerase, Stratagene), in a 100-fil 
reaction volume with the appropriate oligonucleotides, as described herein at 0.1 nmol each. 8-^1 
ahquots of cycle 20-30 Qanes 2-12) of the PGR were resolved on a 2% agarose gel (Invitrogen) 
according to the manufacturer's instractions. 

GeU-based assays of the invention may optionally use a variant or derivative of a wild-type 
SARS virus that has reduced or attenuated virulence in humans and/or animal models (e.g. , 
mouse, non-human primate, etc.) Use of such attenuated SARS viruses in screening methods 
may reduce safety concerns and precautions that would otherwise be associated with the 
pathogenic nature of the SARS virus and may eliminate or reduce the need for the 
implementation of cumbersome high containment levels during performance of the assays and 
screening of compounds. 

The invention includes an enzyme-based assay that can be used to screen for and identify a 
therapeutically active agent for the treatment of SARS virus infection. 

An embodiment of the invention is an assay comprising contacting a known quantity of 
SARS protease in solution to a peptide containing a detectable marker and cleavage sitte for 
SARS protease, wherein SARS protease activity is monitored by measuring the intensity of the 
marker on the cleaved product. 

In a more particular embodiment, a method of assaying for SARS protease is provided 
comprising contacting a sample solution containing SARS protease with a peptide containing a 
fluorescent donor, fluorescent quencher, and cleavage site for SARS protease, said peptide being 
detectable with a fluorometer when cleaved, wherein SARS protease activity is determined in the 
sample by the amount of fluorescence detected by the fluorometer. 

Assays based on the direct measurement of SARS protease inhibition may be utilized for 
screening for SARS therapeutics. Protease for such assays such as 3C-like protease and papain- 
like protease may be isolated and purified for such assays as described in Seybert, et ed., J. Gen. 
Virol., 78:71-75, 1997, Ziebuhr, et al. Adv. Exp. Med. Biol., 440:115-120, 1998, Sims, et al. 
Adv. Exp. Med. Biol. 440:129-134, 1998, Ziebuhr, et al, J. Virol.. 73:177-185, 1999. Teng, et 
al., J. Virol., 73:2658-2666, 1999, Herold, et^l, J. Biol. Chem. 274:14918-14925, 1999, and 
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Ziebuhr, etal.,J. Biol. Chem, 276:33220-33232, 2001. Furthennore, Example 30 describes a 
novel method of purifying SARS protease using column chromatography. Example 31 describes 
a continuous fluorescence resonance energy transfer (FRET) assay for measuring SARS protease 
activity. Protease enzyme based assays such as the FRET assay demonstrated in Example 31 are 
readily adapted for high-throughput screening and are used for screening candidate antiviral 
compounds. Performance of the protease enzymatic assay in the presence of a SARS protease 
inhibitor compound will show a decreased amount of fluorescence at a given time when 
compared to negative control assay containing no test compound on a non-inhibiting control 
compound. Such a method would involve the steps of: (a) providing an assay solution 
comprising SARS protease; (b) adding a test compound to the assay solution; (c) adding a 
substrate for SARS protease to the assay solution; and (d) measuring the proteolytic activity in 
the assay solution. In a preferred embodiment, the proteolytic activity is measured by the 
fluorescence of fluorophore product produced by the enzymatic activity of SARS protease. 

Attenuated SARS virus variants generally contain one or more genome modifications or 
mutations (e.g., substitutions, deletions, insertions) in protein encoding or non-coding regions. 
Specific examples of attenuating mutations include, for example, genetic modifications in the 5.'- 
end noncoding region, leader sequence, intergenic regions, 3'-end noncoding region, ORF la, 
ORF lb, S gene, E gene, M gene, N gene, or any of the nonstructural protein genes outside of the 
ORF la/lb region. Preferred attenuating mutations are in a SARS virus structural protein (e.g.. 
Spike (S)), a protease or polymerase domain, or a non-coding sequence (e.g. , 5'-end noncoding 
region, intergenic sequence). In addition, a cleavage site may be introduced or eliminated within 
the spike protein (see for example, Gombold et al, J. Virol. 67:4504-4512, 1993; Bos et al. 
Virology 214:453-463. 1995), such modification that may also be useful for optimization of • 
expression of recombinant spike protein antigen (e.g. , for vaccine purposes). 

A variety of methods are used according to the present invention in order to obtain 
attenuated variants of SARS virus. Such methods include serial passage of the SARS virus in 
cultured ceUs (e.g., manmialian cell culture, such as fetal rhesus kidney ceUs or VERO cells), 
until the SARS virus demonstrates attenuated function. The serial propagation of virus may be 
performed at any temperature at which tissue culture passage attenuation occurs, and may be 
performed in conjunction with one or more steps of mutagenesis (e.g., chemical mutagenesis). 
The attenuated phenotype of SARS virus variants, obtained after one or more cell culture 
passages, is readily measured by one skilled in the art. As used herein, attenuation refers to the 
decreased virulence of the SARS virus in a human subject. Evidence of attenuated fimction may 
be indicated by decreased levels of viral replication or by decreased virulence in an animal 
model. 
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Other methods of producing an attenuated SARS virus include cell culture passage of the 
virus at sub-optimal temperatures (cold passage), as well as introductioh of attenuating mutations 
into the SARS viral genome by random mutagenesis {e.g., chemical mutagenesis, such as using 
5-fluorouracil) or using directed mutagenesis. Preparation and generation of attenuated RSV 
vaccines (the methods of which will generally applicable to SARS virus) are disclosed in, for 
example, EP 0640128, US Patent No. 6.284,254, US Patent No. 5.922,326, US Patent No. 
5,882.651. 

The number of passages required to obtain safe, immunizing attenuated virus is dependent 
at least in part on the conditions employed. Periodic testing of the SARS virus culture for 
virulence and immunizing ability in animals (e.g., mouse, primate) can readily determine the 
parameters for a particular combination of tissue culture and temperature. 

In another embodiment, the cell-based assay for screening of antiviral compounds is based 
on the readout of expression of a gene product (e.g., reporter gene product) that is not from 
SARS virus. Gene products particularly suitable to the present invention include, but are not 
limited to those of the above-described assays. 

In order to achieve such a read-out, the gene-of-interest (GOI) encoding said gene reporter 
gene product must be incorporated into a replicating SARS virus genome or construct derived 
from a SARS virus genome (eg., SARS virus replicon, SARS virus defective-interfering (DI) 
RNA). Figure 13 is a schematic depicting locations for incorporation of the reporter gene into a 
SARS virus genome. Preferably, insertion of a heterologous reporter gene-of-interest is at a site 
between existing SARS virus genes, such as for example, as shown in Figure 13. For example, 
the GOI may be inserted closely following the termination codon of a SARS virus gene (e.g., 
ORF lb, S, E, M, N). Insertion should be positioned in order to minimize disruption of mRNA 
transcription for the SARS virus gene(s). The GOI may also be inserted as an in-frame "fusion" 
with an existing SARS virus gene, such that sufficient function of the GOI is maintained for 
detection. To optimize expression, an additional SARS virus intergenic sequence {e.g., SEQ ID 
NO: 7388, with or without additional flanking SARS virus sequences) may also be engineered 
into a position preceding the inserted GOI. 

Incorporation of a GOI into SARS virus may be accomplished by one of skill in the art 
using a variety of techniques. For example, one preferred method is targeted RNA 
recombination, that takes advantage of the ability of coronavirus RNAs to undergo 
recombination within the cell (see for example Fischer et al, J. Virol. 71:5148-5160, 1997; 
Koljesar et al, J. Vet. Sci. 2:149-157, 2001). A construct of desired configuration {e.g., cDNA 
of defective interfering RNA of SARS virus) containing the GOI flanked by SARS virus 
sequence {e.g., intergenic sequence) is generated such that RNA may be;transcribed directly 
within a eukaryotic cell or in vitro and transfected into susceptible cells also infected with SARS 
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viras. Recombinant virus containing the GOI is identified based on expression of the GOI 
encoded marker. 

Altematively, incorporation of a GOI into S ARS virus may be accomplished by one of 
skill in the art by first assembling a full-length cDNA clone of the SARS virus, that can be used 
5 to produce infectious RNA transcripts in vivo (e.g. , from an RNA polymerase n promoter) or in 
vitro {e.g., from a bacteriophage promoter). Although relatively long in genome length, such 
assembly of a full-length cDNA clone is now readily obtainable by one of skill in the art using 
standard molecular biology and reverse genetics techniques and the genome sequence of SARS 
virus (see for example, Thiel et al, J. Gen. Virol., 82:1273-1281, 2001; Almazan etal, Proc. 
10 Natl. Acad. Sci. USA 97:5516-5521, 2000; Thiel et al (2003) / Gen Virol 82:1273-1281; Yount 
et al (2003) PNAS USA 100:12995-13000). Insertion of a heterologous GOI into a full-length 
SARS virus genome cDNA may be performed using a variety of techniques, such as for 
example, ligation into natural or synthetic restriction sites, PGR {e.g., overlapping PGR), and 
recombination. 

15 It may also be desirable to utilize similar SARS virus recombinants containing a gene-of- 

interest for antiviral screening, however, with further modification to minimize or eliminate 
virus-induced cytopathology {e.g., CPE). Non-cytopathic derivatives from SARS virus may be 
obtained by one of skill in the art using a variety of methods. For example, a selectable marker 
(e.g., drug resistance marker) may be incorporated as GOI into a SARS virus genome to produce 
20 infectious virus as described above (see for example, Perri et al, J. Virol., 74:9802-9807, 2000). 
Infectious GOI-containing SARS virus or infectious genome RNA/cDNA is then used to 
infect/transfect cells {e.g., VERO), with or without prior mutagenesis, after which time the 
infected cells are subjected to the appropriate selection. Only those cells containing SARS virus 
harboring both the selectable marker and one or more mutations rendering the virus non- 
25 cytopathic will survive the selection process and grow out. Active SARS virus replication in 
these cells is readily detected using a variety of detection techniques {e.g. , PGR, Northern blot) 
and such cells may serve as the substrate for cell-based screening assays. Mutations that result in 
the desired noncytopathic SARS virus phenotype may include nucleotide substitutions, deletions 
or additions, and may occur in a variety of genome coding or non-coding regions {e.g., 5' or 3'- 
30 end noncoding regions, intergenic regions, ORFl a, ORFlb, a protease domain, a polymerase 

domain). The identification of such mutations is readily accomplished by exchange of sequences 
with wild-type (e.g., parental) SARS virus and demonstrating transfer of the phenotype, and 
sequencing of the appropriate genome region. Similar mutations that reduce or eliminate 
cytopathogenicity also may be utilized in the context of a SARS virus derived replicon vector, 
35 either by similar selection directly using a SARS virus replicon or by specific engineering of the 
repUcon based on mutation(s) identified in tiie context of infectious SARS virus as described 
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above. In addition, such mutations may serve as tiie basis for attenuated S ARS virus derivatives, 
as described elsewhere in this document. 

Alternatively, rather tiian using infectious SARS virus or its derivatives for cell-based 
screening assays, propagation defective "replicons" may be engineered and utilized. Such 
replicons maintain all protein encoding sequences and cis replication sequences required for 
RNA replication and expression within a cell, but are deleted of one or more sequences or genes 
required for packaging of progeny SARS virus (see for example Curtis et al, J. Virol., 76:1422- 
1434, 2002). Figure 14 is a schematic depicting representative examples of SARS virus 
replicons according to the present invention. For example a SARS virus cDNA constract is 
generated, that is lacking one or more (or all) structural protein encoding genes, whereby the 
missing SARS vuus gene(s) is/are replaced by the GOI, maintaining all necessary transcription 
signals for expression of the GOI. Operably linked with the SARS virus replicon cDNA 
constract is a promoter for RNA polymerase tfiat can be used to transcribe the replicon RNA in 
vivo (e.g., RNA polymerase H promoter) or in vitro (e.g., bacteriophage promoter). The SARS 
replicon may be introduced into a susceptible cell by transfection as RNA or DNA, depending on 
tiie promoter of choice, and the transfected cells may be utilized for the evaluation of antiviral 
compounds. By incorporating one or more mutations rendering the replicon noncytopathic for 
the cells (see above), one can avoid the need for nucleic acid transfection each time an assay is to 
be performed. 

Alternatively, SARS virus replicons may be packaged into virus like particles that allow 
infection of cells, rather than requiring transfection of nucleic acid molecules, A requirement for , 
replicon packaging is that essential SARS virus gene functions deleted from the replicon (e.g., 
one or more stmctural proteins) are provided in trans within the cell containing the replicon. A 
variety of methods for packaging of replicon RNA can be utiMzed to one of skill in the art (see 
for example, Curtis et al., ibid: Ortego, et al, J. Virol., 76:11518-11529, 2002). For example, 
stably transformed cell lines constitutively or inducibly expressing the required SARS virus gene 
functions may be utilized. Alternatively, the required SARS virus gene functions may be 
expressed by viral vectors that are introduced into the replicon-containing cell. Alternatively a 
defective interfering (DI) SARS virus derived RNA containing the required gene functions may 
be introduced into the replicon-containing cell. Such DI constructs used to complement missing 
replicon functions may be more conmionly referred to as defective helper RNA or defective 
helpers. 

Another configuration useful for cell-based antiviral screening assays according to the 
present invention utilizes SARS virus derived DI RNAs encoding a GOI (see for example 
Stirrups, et al, J. Gen. Virol., 81:1687-1698, 2000; Liao, et al. Virology 208:319-327, 1995). 
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Introduction of a SARS DI, either as cDNA linked to an RNA polymerase H promoter or as in 
vitro transcribed RNA. into susceptible cells also infected with SARS virus, allows for a readout 
of the GOI reporter product in assays. 

A replicon-based system for rapid identification of coronavirus repUcase inhibitors is 
described by Hertzig et al. (2004) J Gen Virol DOI 10.1099/vir/0/80044-0. Briefly, the system 
uses a non-cytopathic selectable repUcon RNA that can be stably maintained in eukaryotic cells. 
The replicon RNA mediates reporter gene expression as a marker for coronavirus replication; 
and expression of the reporter can be used to test the inhibitory effect of test compounds in vitro, 
thereby aUowing high throughput screening for repUcase inhibitors without the need to grow 
infectious virus. Preferred repHcon RNAs include a neomycin resistance gene in the repHcase 
gene with a downstream reporter gene {e.g. GFP) that is expressed via replicase-mediated 
synthesis of a sub-genomic mRNA. 

VI. COMPOSITIONS AND METHODS FOR TREATMENT OF SARS VIRUS INFECTION 
The present invention relates to compositions and methods for the treatment and/or prevention of 
SARS. The invention further includes a method for the treatment and/or prevention of SARS : 
through the administration of a therapeuticaUy effective amount of at least one antiviral, 
compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compound is a smaU molecule. In another embodiment, the antiviral compound is a protease 
inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-Uke protease inhibitor. Combined treatment with the lopinavir/ritonavir 
(Kaletra) protease inhibitor and ribavirin has shown a favorable clinical response (Chu et al. 
(2004) Thorax 59:252-256). In another embodiment, the antiviral compound is an inhibitor of an 
RNA dependent RNA polymerase. In another embodiment, a first antiviral compound that is a 
protease inhibitor is administered with a second antiviral compound that is an RNA-dependent 
RNA polymerase inhibitor. The invention further provides for the administration of a steroidal 
anti-inflammatory drug in combination with at least one antiviral compound, for example, ftom 
the antiviral compounds described in the documents listed in Table 1 and Table 2. A 
combination treatment of steroids and ribavirin has been described by Fujii et al. (2004) J Infect 
CJiemother 10:1-7. A combination treatment of corticosteroids and interferon alfacon-1 has also 
been reported (Loutfy et al (2003) JAMA 290:3222-3228). 

The invention further provides for a method for the treatment and/or prevention of SARS 
through the administration of a therapeutically effective amount of at least one antiviral 
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compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2 by inhalation. In another aspect, the antiviral 
compound may be administered in combination with a SMIP, SMIS , or other 
immunomodulatory compound such as those in Table 34 and in Table 35. In one embodiment of 
the method, the antiviral compound is a small molecule. In another embodiment, the antiviral 
compound is a protease inhibitor. In a further embodiment, the antiviral protease inhibitor is a 
3C-like protease inhibitor and/or a papain-like protease inhibitor. In another embodiment, the 
antiviral compound is an inhibitor of an RNA dependent RNA polymerase. In another 
embodiment, a first antiviral compound that is a protease inhibitor is administered with a second 
antiviral compound that is an RNA-dependent RNA polymerase inhibitor. The invention further 
provides for the administration of a steroidal anti-inflanmiatory drug in combination with at least 
one antiviral compound, for example, from the antiviral compounds described in the documents 
listed in Table 1 and Table 2. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administered for systemic absorption such as via an oral or 
intravenous route. 

The invention further provides for methods for treating SARS infection comprising 
administering a small molecule immunopotentiator (SMIP) compound either alone or in 
combination with an antiviral compound or in combination with a SARS vaccine. In a further 
embodiment, the SMIP is a compound disclosed herein or set forth in Table 34. 

The invention further provides for methods for treating SARS infection comprising 
administering an immunosuppressant compound, optionally a small molecule suppressant 
(SMIS) compound either alone or in coinbination with an antiviral compound. In a further 
embodiment, the immunosuppressant compound is disclosed herein or set forth in Table 35. 

The invention further provides peptidic inmiunomodulating compoisitions, that include 
oligo and polypeptides, capable of effecting inflammatory response in a patient. In one 
embodiment, the peptidic immunomodulating composition is able to stimulate human cells to 
produce cytokines. In another embodiment the peptidic immunomodulating composition is 
capable of decreasing cytokine levels in the human. Preferred Examples of peptidic 
immunomodulating compositions include those listed in Table 35, as well as TGFP2, TGppi, 
TGFp3, thymopentin (TPS), p-mercaptopropionyl-ar^nyl-lysyl-aspartyl-valyl-tyrosyl-cysteine 
amide, colostiinine, lactoferrin (LP), cyclolinopeptide A (CLA), and tuftsin (TKPR). The 
peptidic immunomodulating compositions of the invention may be used alone or in combination 
with other agents, preferably antiviral compounds, for the treatment of SARS. 
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The invention further provides for a kit for use by a consumer for the treatment and/or 
prevention of SARS. Such a kit comprises: a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral, SMIP, SMIS, or other 
unmunomodulating compound from among those described in the US Patents and pubUshed 
intemational patent applications listed in Table 1 , Table 2, Table 34 and Table 35 and a 
pharmaceutically acceptable carrier, vehicle or diluent; b) a container for holding the 
pharmaceutical composition; and, optionally, c) instiiictions describing a method of using tiie 
pharmaceutical compositions for tiie tireatinent and or the prevention of SARS. The kit may 
optionally contain a plurality of compounds for tiie treatinent of SARS wherein tiie antiviral 
compounds are selected from 3C-like protease inhibitors and papain-like protease inhibitors. In 
a further embodiment, tiie kit contains an antiviral compound that is an RNA-dependent RNA 
polymerase inhibitor. When tiie kit comprises more tiian one antiviral, SMIP, SMIS, or otiier 
immunomodulating compound, the compounds contained in the kit may be optionaUy combined 
in the same pharmaceutical composition. 

An additional aspect of tiie invention provides for tiie use of at least one of tiie antiviral, 
SMIP, SMIS, or otiier immunomodulating compounds described in tiie US Patents and published 
intemational patent applications listed in Table 1, Table 2. Table 34 and Table 35 for tiie 
manufacture of a medicament for the tireatinent or prevention of SARS. 

An additional aspect of tiie invention provides for tiie use of at least one SMIP compound, 
or at least one immunosuppressant compound, or at least one SMIS compound for tiie 
manufactiire of a medicament for tiie treatinent or prevention of SARS. Preferred SMIP, 
immunosuppressant, and SMIS compounds are described herein. 

Unless otherwise specified, the following terms, when used witiiin Section VI: 
"Compositions and Methods for Treatment of SARS Virus Infection" of tiie present application 
have the meanings as defined below: 

As used herein, "limit", "treat" and "treatment" are interchangeable terms as are "limiting" 
and "tieating" and, as used herein, include preventative (e.g., prophylactic) and palliative 
tieatinent or the act of providing preventative or palliative tteatinent. The terms include a 
postponement of development of SARS symptoms and/or a reduction in tiie severity of such 
symptoms tiiat wiU or are expected to develop following infection witii a SARS virus. The terms 
further include ameliorating existing SARS symptoms, preventing additional symptoms, 
ameliorating or preventing the underiying metabolic causes of symptoms. 

Representative uses of tiie compositions and methods of tiie present invention include: tiie 
elimination or reduction of the viral load of the SARS virus in a vertebrate, including humans, 
tiie elimination or reduction of symptoms associated witii SARS, and a reduction in morbidity' 
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associated with SARS. In a SARS patient population, the use of the compositions and methods 
of the invention will result in the reduction in the high mortality rates associated with SARS. 

Infection with the SARS virus and the symptoms associated with SARS can be treated in a 
subject by administering the compositions of the invention. The compositions of the invention 
may be administered systemically. For systemic use, the compounds herein are formulated for 
parenteral (e.g., intravenous, subcutaneous, intramuscular, intraperitoneal, intranasal or 
transdermal) or enteral {e.g., oral or rectal) delivery according to conventional metiiods. 
Intravenous administration can be by a series of injections or by continuous infusion over an 
extended period. Administration by injection or other routes of discretely spaced administration 
can be performed at intervals ranging from weekly to once to three times daily or more. 
Alternatively; the compositions disclosed herein may be administered in a cyclical manner 
(administration of disclosed composition, followed by no administration, followed by 
administration of disclosed compositions, and the like). Treatment will continue until the desired 
outcome is achieved. 

: A "subject" is a vertebrate animal including a human that is in need of treatment with the 
compositions, inethods and kits of the present invention. The term "subject" or "subjects" is 
intended to refer to both the male and female gender unless one gender is specifically indicated. 

"Coadministration" of a combination of a plurality of antiviral compounds means that these 
components can be administered together as a composition or as part of the same, unitary dosage 
form "Co-administration" also includes administering a plurality of antiviral compounds 
separately but as part of the same therapeutic treatment program or regimen. "Co-administration" 
also includes administering a plurality of other agents, such as, for example an oligopeptide, a 
polypeptide, a peptidic immunomodulator, nucleic acid, antibodies, or a vaccine wherein the 
compounds or agents are administered separately but as part of the same therapeutic treatment • 
program or regimen. The components need not necessarily be administered at essentially the 
same time, although they can if so desired. "Co-administration" also includes separate 
administration at different times and in any order. For example, where appropriate a patient may 
take one or more component(s) of the treatment in the morning and the one or more of the other 
component(s) at night. 

By "antiviral compound" as used herein is meant an antiviral compound as described in the 
US Patents and published international patent applications listed in Table 1 and Table 2. The US 
Patents and published international patent applications listed in Table 1, Table 2 and Table 35 
are incorporated herein in their entirety. In one embodiment, the antiviral compound is an RNA- 
dependent RNA polymerase. In another preferred embodiment the antiviral compound is a 3C- 
like protease inhibitor or a papain-like protease inhibitor. The antiviral compounds may be 
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administered in the form of the acid, or of a soluble alkali metal salt or alkaline earth metal salt 
where appropriate. 

The precise dosage of the antiviral compound will vary with the dosing schedule, the oral 
potency of the particular antiviral compound chosen, the age, size, sex and condition of the 
subject, the severity of the disorder to be treated, and other relevant medical and physical factors. 
Thus, a precise pharmaceutically effective amount cannot be specified in advance and can be 
readily determined by the caregiver or clinician. 

Generally, an appropriate amount of antiviral compound is chosen to obtain a reduction in 
the load of the SARS virus in the subject and/or to obtain a reduction in the symptoms associated 
with SARS. For humans, an effective oral dose of antiviral compound is typically from about 
1.5 to about 6000 fig/kg body weight per day and preferably about 10 to about 2000 fig/kg of 
body weight per day. 

One of ordinary skill in the art will recognize that certain antiviral, SMIP, SMIS, and 
immunomodulating compounds of the invention including 3C-like protease inhibitors, papain- 
like protease inhibitators, and RNA-dependent RNA polymerase inhibitors will contain one or 
more atoms that may be in a particular stereochemical, tautomeric, or geometric configuration, 
giving rise to stereoisomers, tautomers and configurational isomers. All such isomeiB and 
mixtures thereof are included in this invention, when active. Crystalline and amorphous forms of 
the antiviral compounds of this invention are also included as are hydrates, solvates, polymorphs, 
and isomorphs of the antiviral compounds of the invention. 

SMIP compounds of the invention include compounds are described in issued U.S. Patent 
Nps. 4,547,51 1 and 4,738,971 with the general structure (a): 

R" O 

^.^Heterocyclyl 
I 

R"' 

IN V.^ 

H 

a 

for the treatment of disorders responsive to agents that enhance cell-mediated unmunity. 

Lnmunostimulatory oligonucleotides and polynucleotides are described in PCT WO 
98/55495 and PCT WO 98/16247. U.S. Patent Application No. 2002/0164341 describes 
adjuvants including an unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic acid 
adjuvant. U.S. Patent AppUcation No. 2002/0197269 describes compositions comprising an 
antigen, an antigenic CpG-ODN and a polycationic polymer. 
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Additionally, issued U.S. Patent Nos. 4,689,338, 5,389,640, 5,268,376, 4,929,624, 
5,266,575, 5,352,784, 5,494,916, 5,482,936, 5,346,905, 5,395,937, 5,238,944. 5,525,612. 
W099/29693 and U.S. Ser. No. 09/361,544 disclose compounds of the general stracture (b): 




N 
b 

for the use as "immune response modifiers." 

Further compounds with SMIP and antiviral activity are described below and in US 
Patent Application entitled Thiosemicarbazones as Anti-Virals and Inmiunbpotentiators filed on . 
December 29, 2003 with an attorney docket number of PP19814.004US generally disclosing 
compounds of the following structures: 
A compound of formula c: 

Y" I I I 
R' R" Z' 

c 

wherein: E is absent or selected from the group consisting of alkyl, substituted alkyl, 
cycloalkyl, substituted cycloalkyl, aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, 
heteroaryl, and substituted heteroaryl; 

L is absent or is selected from the group consisting of oxo, aminos alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and carbonyl; 

W is absent or selected from the group consisting of cycloalkyl, substituted cycloalkyl, 
aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted 
heteroaryl; 

X is absent oris selected from the group consisting of oxo, amino, alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and carbonyl; 

Y is selected from the group consisting of cycloalkyl, substituted cycloalkyl, aryl, 
substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted heteroaryl; 
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Y' is absent or is selected from the group consisting of F, CI, Br, I, nitro, alkyl, 
substituted alkyl, and optionally substituted heterocyclyl, amino, alkylamino, dialkylamino; 
Y" is absent or is selected from the group consisting of F, CI, Br, I, nitro, alkyl, substituted alkyl, 
and optionally substituted heterocyclyl, amino, alkylamino, dialkylamino; 
R' is H, alkyl, or substituted alkyl; 
R" is H, or 

R' and R' ' are taken together to form a hetercyclic ring; 

Z and Z' are independently selected from the group consisting of hydrogen, alkyl, substituted 
alkyl, aryl, substituted aryl, arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substimted or unsubstituted 
alkylcarbonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyl oxy, heteroarylcarbonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 
heteroarylcarbonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido; or 

Z and Z' are taken together to form a heterocyclic group, that may be optionally substituted and 
the tautomers and the pharmaceutically acceptable salts». esters, or prodrugs thereof. 

Further SMTP compounds are described below and in US Patent Application 10/762873, 
Use of Tryptanthrin Compounds for Immune Potentiation, filed on January 21 , 2004 and 
disclosing the general embodiment of compounds represented by Formula (d): 



wherein 

A, B, C, D, E, F, G, and H are independently selected from carbon and nitrogen, or A and 
B and/or C and D can be taken together to be nitrogen or sulfur; 
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Rl, R2, R3, R4. Rg, and Rjo are independently selected from the group consisting of hydrogen, 
halogen, loweralkyl, alkyl, substituted alkyl, cycloalkyl, heterocyclyl, alkylheterocyclyl, 
substituted heterocyclyl, substituted alkenyl, amino, (substituted alkyl)(alkyl)amino. imino, 
haloloweralkyl, hydroxy, alkoxy. substituted alkoxy. hydroxyalkylthio. nitro, alkylsulfonyl, N- 
alkylsulfonamide, arylalkyl, arylalkylaryl, arylaryl, aryloxy, arylamino, acylamino, 
acyloxyamino, alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, 
dialkylamino, alkoxyalkylamino, alkoxyalkylheterocyclyl, mercaptoalkoxyalkyl, cyano, - 
formyl, -COORn wherein Rn is hydrogen, loweralkyl, aryl, heterocyclyl, monosaccharide or 
disaccharide. and -CONRjjRis wherein R12 and R13 are mdependently selected from hydrogen, 
loweralkyl, aryl, heterocyclyl, saccharide, peptide and amino acid residues; or R2 and R3 taken 
together form a six membered aromatic ring; 

R7 and R9 are independently selected from hydrogen, halogen, loweralkyl, 
haloloweralkyl, cycloalkyl, heterocyclyl, substituted heterocyclyl or heterocyclylalkyl; and 

Rl, R2, R3, R4. R7, Rs, R9,and R^q are absent when the ring atom to which they would 
otherwise be bonded is sulfur or double-bonded nitrogen; or 

the a pharmaceutically acceptable salts, esters, or prodrugs thereof, 
provided that R,, R2, R3, R4, R7, Rs, R9, and R,o are not all hydrogen when A, B, C, E, F. and 
Hare carbon. 

In one embodiment, the compounds of Formula (I) have a backbone structure wherein D is 
nitrogen, and A-C and E-H are carbon. 

In one embodiment, when D is carbon, at least one, or at least two of Rj - R4^ and R7 - Rjq are 
not hydrogen. 

In one embodiment, Rj through R4^ and Rg and Rjq are independently selected from at least two 
of the group consisting of hydrogen, halogen, loweralkyl, cycloalkyl, heterocyclyl, substituted 
heterocyclyl, alkylheterocyclyl, amino, imino, haloloweralkyl, alkoxy, nitro, alkylsulfonyl, 
arylalkyl, arylalkylaryl, arylaryl, aryloxy, arylamino, acylamino, acyloxyamino, 
alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, dialkylamino, 
alkoxyalkylamino, alkoxyalkylheterocyclyl, mercaptoalkoxyalkyl, cyano, formyl, -COORu 
where Rjj is hydrogen, loweralkyl. aryl, heterocyclyl, monosaccharide or disaccharide, 
and -CONRi2Ri3 where Rjj and Rjj are independently selected from hydrogen, loweralkyl, 
aryl, heterocyclyl, saccharide, peptide and amino acid residues; and R4 is not present when D is 
nitrogen. 
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In an additional embodiment, 4A, B, C. D, E, F, G. and H are independently selected from 
carbon and nitrogen; 

Rl, R2, R3, R4, Rg and Rjo are independenUy selected from the group consisting of 
hydrogen, halogen, loweralkyl, alkyl, substituted alkyl, heterocyclyl, substituted heterocyclyl, 
substituted alkenyl. (substituted alkyl)(alkyl)amino, haloloweralkyl, hydroxy, alkoxy, substituted 
alkoxy, hydroxyalkylthio, nitro, N-alkylsulf on amide, cyano, -COORji wherein Ru is hydrogen, 
loweralkyl, aryl, heterocyclyl, monosaccharide or disaccharide, and -CONR12R13 wherein R12 
and Ri3 are independenfly selected from hydrogen, loweralkyl, aryl, heterocyclyl. saccharide, 
peptide and amino acid residues. 

For the compounds described herein: ' 

The term "loweralkyl" refers to branched or sti^ght chain acyclical alkyl groups 
comprising pne to ten carbon atoms, including, e.g., methyl, ethyl, propyl, isopropyl, n.butyl, 
t-butyl, neopentyl and the like. 

The term "alkyl" refers to alkyl groups that do not contain heteroatoms. Thus the term . 
includes straight chain alkyl groups such as methyl, ethyl, propyl, butyl, pentyl. hexyl, heptyl, 
octyl, nonyl, decyl, undecyl, dodecyl and the Uke. The phrase also includes branched chain • 
isomers of straight chain alkyl groups, including but not limited to. the foUowing tiiat are 
provided by way of example: -CH(CH3)2, -CH(CH3)(CH2CH3), -CH(CH2CH3)2. - 
C(CH3)3, -C(CH2CH3)3, -CH2CH(CH3)2, -CH2CH(CH3)(CH2CH3). - 
CH2CH(CH2CH3)2. -CH2C(CH3)3, -CH2C(CH2CH3)3, - 

CH(CH3)CH(CH3)(CH2CH3), -CH2CH2CH(CH3)2. -CH2CH2CH(CH3)(CH2CH3), - 
CH2CH2CH(CH2CH3)2. -CH2CH2C(CH3)3, -CH2CH2C(CH2CH3)3, - 
CH(CH3)CH2CH(CH3)2. -CH(CH3)CH(CH3)CH(CH3)2. - 

CH(CH2CH3)CH(CH3)CH(CH3)(CH2CH3). and others. The term also includes cyclic alkyl 
groups such as cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, and cyclooctyl and 
such rings substituted with sti^ght and branched chain allq^l groups as defined above. The term 
also includes polycyclic alkyl groups such as, but not limited to. adamantyl norbomyl. and 
bicyclo[2.2.2]octyl and such rings substituted with straight and branched chain alkyl groups as 
defined above. Thus, the phrase unsubstituted alkyl groups includes primary alkyl groups, 
secondary alkyl groups, and tertiary alkyl groups. Unsubstituted alkyl groups may be bonded to 
one or more carbon atom(s), oxygen atom(s), niti-ogen atom(s), and/or sulfur atom(s) in tiie 
parent compound. Preferred unsubstitiited alkyl groups include straight and branched chain alkyl 
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groups and cyclic alkyl groups having 1 to 20 carbon atoms. More prefened such unsubstituted 
alkyl groups have from 1 to 10 carbon atoms while even more preferred such groups have from 1 
to 5 carbon atoms. Most preferred unsubstituted alkyl groups include straight and branched 
chain alkyl groups having from 1 to 3 carbon atoms and include methyl, ethyl, propyl, and ~ 
CH(CH3)2. 

The phrase "substituted alkyl" refers to an unsubstituted alkyl group as defined above in 
which one or more bonds to a carbon(s) or hydrogen(s) are replaced by a bond to non-hydrogen 
and non-carbon atoms such as, but not limited to, a halogen atom in halides such as F, CI, Br, 
and I; a phosphorus atom in groups such as phosphate and dialkyl alkylphosphonate; oxygen 
atom in groups such as hydroxyl groups, alkoxy groups, aryloxy groups, and ester groups; a 
sulfur atom in groups such as thiol groups, alkyl and aryl sulfide groups, sulfone groups, sulfonyl 
groups, and sulfoxide groups; a nitrogen atom in groups such as aminesi amides, alkylamines, 
dialkylamines, arylamines, alkylarylamines, diarylamines, N-oxides, imides, and enamines; a 
silicon atom in groups such as in trialkylsilyl groups, dialkylarylsilyl groups, alkyldiarylsilyl 
groups, and triarylsilyl groups; and other heteroatoms in various other groups. Substituted alkyl 
groups also include groups in which one or more bonds to a carbon(s) or hydrogen(s) atom is 
replaced by a bond to a heteroatom such as oxygen in carbonyl, carboxyl, and ester groups; 
nitrogen in groups such as imines, oximes, hydrazones, and nitriles. Preferred substituted alkyl 
groups include, among others, alkyl groups in which one or more bonds to a carbon or hydrogen 
atom is/are replaced by one or more bonds to fluorine atoms. One example of a substituted alkyl ; 
group is the trifluoromethyl group and other alkyl groups that contain the trifluoromethyl group. 
Other alkyl groups include those in which one or more bonds to a carbon or hydrogen atom is 
replaced by a bond to an oxygen atom such that the substituted alkyl group contains a hydroxyl, 
alkoxy, aryloxy group, or hetercx:yclyloxy group. Still other alkyl groups include alkyl groups 
that have an amine, alkylamine, dialkylamine, arylamine, (alkyl)(aryl)amine,diarylamine, 
heterocyclylamine, (alkyl)(het6rocyclyl)amine, (aryl)(heterocyclyl)amine, or 
diheterocyclylamine group. 

The term "alkoxy" refers to RO- wherein R, for example, is alkyl, such as loweralkyl 
defined above. Representative examples of loweralkyl alkoxy groups include methoxy, ethoxy, 
t-butoxy and the like. 

The phrase "substituted alkoxy" refers to RO-, where R is, for example, an alkyl 
substituted, for example, with a halogen. RO is for example OCF3. 
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The term "alkenyl" refers to a branched or straight chain groups comprising two to 
twenty carbon atoms that also comprises one or more carbon-carbon double bonds. 
Representative alkenyl groups include prenyl, 2-propenyl (i.e., ally]), 3-methyl-2-butenyl, 
3,7-dimethyl-2,6-octadienyI, 4,8-dimethyl-3,7-nonadienyl, 3,7,1 l-trimethyl-2,6,10-dodecatrienyl 
and the like. 

The phrase "substituted alkenyl" refers to alkenyl groups that are substituted, for 
example, diethyl hex-5-enylphosponate, and others with an alkyl or substituted alkyl group such 
as dialkyl phosphate or an ester such as an acetate ester. 

• The phrase "dialkyl amino" refers to an amino group substituted with two alkyl groups 
such as Cl-20 alkyl groups. 

The phrase "substituted dialkyl amino" refers to a dialkylamino substituted, for example, 
with a carboxylic acid, ester, hydroxy or alkoxy. 

The term "hydroxyalkylthio" refers Jo a thio radical to which is appended a hydroxyalkyl 
group, where the alkyl is for example lower alkyl. An example is hydroxyethylthio, - 
SCH2CH2OH. 

The term "N-alkylsulfonamide" refers to the group -SOzNHalkyl, where alkyl is, for 
example, octyl. 

The term "alkynyl" refers to a branched or straight chain comprising two to twenty 
carbon atoms that also comprises one or more carbon-carbon triple bonds. Representative 
alkynyl groups include ethynyl, 2-propynyl (propargyl), l-propynyl and the like. 

The term "aryl" refers to aryl groups that do not contain heteroatoms. Thus the tenn 
includes, but is not limited to, groups such as phenyl, biphenyl, anthracenyl. naphthenyl by way 
of example. Although the phrase "unsubstituted aryl" includes groups containing condensed 
rings such as naphthalene, it does not include aryl groups that have other groups such as alkyl or 
halo groups bonded to one of the ring members, as aryl groups such as tolyl are considered 
herein to be substituted aryl groups as described below. A preferred unsubstituted aryl group is 
phenyl. Unsubstituted aryl groups may be bonded to one or more carbon atom(s), oxygen 
atom(s), nitrogen atom(s), and/or sulfur atom(s) in the parent compound, however. 

The phrase "substituted aryl group" has the same meaning with respect to aryl groups that 
substituted alkyl groups had with respect to alkyl groups. However, a substituted aryl group also 
includes aryl groups in which one of the aromatic carbons is bonded to one of the non-carbon or 
non-hydrogen atoms described above and also includes aryl groups in which one or more 
aromatic carbons of the aryl group is bonded to a substituted and/or unsubstituted alkyl, alkenyl, 
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or alkynyl group as defined herein. This includes bonding arrangements in which two carbon 
atoms of an aryl group are bonded to two atoms of an alkyl, alkenyl, or alkynyl group to define a 
fused ring system (e.g. dihydronaphthyl or tetrahydronaphthyl). Thus, the phrase "substituted 
aryl" includes, but is not limited to tolyl, and hydroxyphenyl among others. 

The term "arylalkyl" refers to a loweralkyi radical to which is appended an aryl group. 
Representative arylalkyl groups include benzyl, phenylethyl, hydroxybenzyl, fluorobenzyl, 
fluorophenylethyl and the like. 

The phrase "unfused arylaryl" refers to a group or substittient to which two aryl groups, 
that are not condensed to each other, are bound. Exemplary unfused arylaryl compounds include, 
for example, phenylbenzene, diphenyldiazene, 4-methylthio-l-phenylbenzene, phenoxybenzene, 
(2-phenylethynyl)benzene, diphenyl ketone, (4-phenylbuta-l,3-diynyl)benzene, 
phenylbenzylamine, (phenylmethoxy)ben2ene, and the like. Preferred substimted unfused 
arylaryl groups include: 2-(phenylamino)-N-[4-(2-phenylethynyl)phenyl]acetamide. 1.4- 
diphenylbenzene, N-[4-(2-phenylethynyl)phenyl]-2-[berizylamino]acetahiide, 2-amino-N-[4-(2- 
phenylethynyl)phenyl]propanamide, 2-amino-N-[4-(2-phenylethynyl)phenyl]acetamide, 2- 
(cyclopropylamino)-N-[4-(2-phenylethynyl)phenyl]acetamide, 2-(ethylamino)-N-[4-(2- 
phenylethynyl)phenyl]acetamide,2-[(2-methylpropyl)amino]-N-[4-(2- 

phenylethynyl)phenyl]acetamide, 5-phenyl-2H-benzo[d] 1 ,3-dioxolene, 2-chloro-l-methoxy-4- 
phenylbenzene, 2-[(imidazolylmethyl)amino]-N-[4-(2-phenylethynyl)phenyl]acetamide, 4- 
phenyl-l-phenoxybenzene, N-(2-aminoethyl)[4-(2-phenylethynyl)phenyl]carboxamide, 2-{ [(4- ■ • 
fluorophenyl)methyl]amino}-N-[4-(2-phenylethynyl)phenyl3acetamide, 2-{[(4- 
methylphenyl)methyl]amino }-N-[4-(2-phenylethynyl)phenyl]acetamide, 4-phenyl-l- 
(trifluoromethyl)benzene, l-butyl-4-phenylbenzene, 2-(cyclohexylaniino)-N-[4-(2- 
phenylethynyl)phenyl]acetamide, 2-(ethylmethylamino)-N-[4-(2- 

phenylethynyl)phenyl]acetamide, 2-(butylamino)-N-[4-(2-phenylethynyl)phenyl]acetamide. N- 
[4-(2-phenylethynyl)phenyl]-2-(4-pyridylamino)acetamide,N-[4-(2-pheriylethynyl)phenyl]-2- 
(quinuchdin-3-ylamino)acetamide,N-[4-(2-phenylethynyl)phenyl]pyrrolidin-2-ylcarboxamide. 
2-amino-3-methyl-N-[4-(2-phenylethynyl)phenyl]butanamide, 4-(4-phenylbuta-l ,3- 
diynyOphenylamine, 2-(dimethylamino)-N-[4-(4-phenylbuta-l .3-diynyl)phenyl]acetamide, 2- 
(ethylamino)-N-[4-(4-phenylbuta-l,3-diynyl)phenyl]acetamide. 4-ethyl-l -phenylbenzene, l-[4- 
(2-phenylethynyl)phenyl]ethan-l-one,N-(l-carbamoyl-2-hydroxypropyl)[4-(4-phenylbuta-l,3- 
diynyl)phenyl]carboxamide, N-[4-(2-phenylethynyI)phenyl]propanamide, 4-methoxyphenyl 
phenyl ketone, phenyl-N-benzamide, (tBrt-butoxy)-N-[(4-phenylphenyl)methyl]carboxamide, 2- 
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(3-phenylphenoxy)ethanehydroxamic add, 3-phenylphenyl propanoate, l-(4-ethoxyphenyl)-4- 
methoxybenzene, and [4-(2-phenylethynyl)phenyl]pynole. 

The phrase "unfused heteroarylaryl" refers to a unfused arylaryl group where one of the 
aryl groups is a heteroary] group. Exemplary heteroarylaryl groups include, for example, 2- 
phenylpyridine, phenylpyrrole, 3-(2-phenylethynyl)pyridine, phenylpyrazole, 5-(2- 
phenylethynyl)-l,3-dihydropyrimidine-2,4-dione, 4-phenyl-l,2,3-thiadiazole, 2-(2- 
phenylethynyl)pyrazine, 2-phenylthiophene, phenylimidazole, 3-(2-piperazinylphenyl)furan, 3- 
(2,4-dichlorophenyl)-4-methylpyrrole, and the like. Preferred substituted unfused heteroarylaryl 
groups include: 5-(2-phenylethynyl)pyrimidine-2-ylamine, l-methoxy-4-(2-thienyl)benzene, 1- 
methoxy-3-(2-thienyl)benzene, 5-methyl-2-phenylpyridine, 5-methyl-3-phenylisoxazole, 2-[3- 
(trifluoromethyl)phenyl]furan, 3-fluoro-5-(2-furyl)-2-methoxy-l-prop-2-enylbenzene, 
(hydroxyimino)(5-phenyl(2-thienyl))methane, 5-[(4-methylpiperazinyl)methyl]-2- 
phenylthiophene, 2-(4-ethylphenyI)thiophene, 4-methylthio-l-(2-thienyI)benzene, 2-(3- 
nitrophenyl)thiophene, (tert-butoxy)-N-[(5-phenyl(3-pyridyl))methyl]carboxamide, hydroxy-N- 
[(5-phenyl(3-pyridyl))methyl]amide, 2-(phenyhnethylthio)pyridine, and benzylimidazole. 

The phrase "unfused heteroarylheteroaryl" refers to an unfused arylaryl group where both 
of the aryl groups is a heteroaryl group. Exemplary heteroarylheteroaryl groups include, for 
example, 3-pyridylimidazole, 2-imidazolylpyrazine, and the Uke. Preferred substituted unfused 
heteroarylheteroaryl groups include: 2-(4-piperazinyl-3-pyridyl)furan, diethyl(3-pyrazin-2-yl(4- 
pyridyl))amine, and dimethyl{2-[2-(5-methylpyrazin-2-yl)ethynyl](4-pyiidyl)}amine. 

. The phrase "fused arylaryl" refers to an aryl group as previously defined that is 
condensed, and fiilly conjugated to an aryl group. Representative fused arylaryl groups include 
biphenyl, 4-(l-naphthyl)phenyl, 4-(2-naphthyl)phenyl and the like. 

The phrase "fused heteroarylaryl" refers to an aryl group as previously defined that is 
condensed, and fully conjugated with a heteroaryl group. Representative fused heteroarylaryl 
groups include quinoline, quinazoline and the like. 

The phrase "fused heteroarylheteroaryl" refers to a heteroaryl group as previously defined 
that is condensed, and fully conjugated with another heteroaryl group. Representative fused 
heteroarylheteroaryl groups include pyrazalopyrimidine, imidazoquinoline and the like. 

The term "aryloxy" refers to RO- wherein R is an aryl group. Representative arylalkoxy 
group include benzyloxy, phenylethoxy and the like. 

The term "arylalkoxy" refers to a lower alkoxy radical to which is appended an aryl 
group. Representative arylalkoxy group include benzyloxy, phenylethoxy and the like. 
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The term "aryloxyaryl" refers to an aryl radical to which is appended an aryloxy group. 
Representative aryloxyaryl groups include 4-phenoxyphenyl, 3-phenoxyphenyl, 
4-phenoxy-l-naphthyl, 3-phenoxy-l-naphthyl and the like. 

The term "aryloxyarylalkyl" refers to an arylalkyl radical to which is appended an 
aryloxy group. Representative aryloxyarylalkyl groups include 4-phenoxyphenylmethyl, 
3-phenoxyphenyhnethyl, 4-phenoxyphenylethyl, 3-phenoxy-phenylethyl and the like. 

The term "arylalkoxyaryl" refers to an aryl radical to which is appended an arylalkoxy 
group. Representative arylalkoxyaryl groups include 4-benzyloxylphenyl, 3-benzyloxyphenyl 
and the like. 

The term "arylalkoxyarylalkyl" refers to an arylalkyl radical to which is appended an 
arylalkoxy group. Representative arylalkoxyarylalkyl groups include 4^ben2yloxylbenzyl, 
3-benzyloxybenzyl and the like. 

The term "cycloalkyl" refers to an aUcyclic group comprising from 3 to 7 caibon atoms 
including, but not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and the like. 

The term "cycloalkylalkyi" refers to a loweralkyl radical to which is appended a 
cycloalkyl group. Representative examples of cycloalkylalkyi include cyclopropyhnethyl, 
cyclohexyhnetiiyl, 2-(cycloprppyI)ethyl and the like. 

The term "halogen" refers to iodine, bromine, chlorine or fluorine; "halo" refers to iodo, 
bromo, chloro or fluoro. . . 

The term "haloalkyl" refers to a lower alkyl radical, as defined above, bearing at least one : 
halogen substituent, for example, chloromethyl, fluoroethyl or trifluoromethyl and the like. 

The term "heterocyclyl" (or heterocyclic, or heterocyclo) refers to both aromatic and 
nonaromatic ring compounds including monocyclic, bicyclic. and polycyclic ring compounds 
such as. but not limited to, quinuclidyl, containing 3 or more ring members of which one or more 
is a heteroatom such as. but not Hmited to, N, O, and S. Although the phrase "unsubstimted 
heterocyclyl" includes condensed heterocyclic rings such as benzimidazolyl, it does not include 
heterocyclyl groups that have other groups such as alkyl or halo groups bonded to one of the ring 
members as compounds such as 2-methylbenzimidazolyl are substituted heterocyclyl groups. 
Examples of heterocyclyl groups include, but are not limited to: unsaturated 3 to 8 membered 
rings containing 1 to 4 nitrogen atoms such as, but not limited to pyrrolyl, pyrrolinyl, imidazolyl, 
pyrazolyl, pyridyl, dihydropyridyl, pyrimidyl, pyrazinyl, pyridazinyl, triazolyl (e.g. 4H-1,2,4- 
triazolyl, lH-l,2,3-triazolyl, 2H-l,2,3-triazolyl etc.), tetrazolyl, (e.g. IH-tetrazolyl, 2H 
tetrazolyl, etc.); saturated 3 to 8 membered rings containing 1 to 4 nitrogen atoms such as. but 
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not limited to, pyrrolidinyl, imidazolidinyl, piperidinyl, piperazinyl; condensed unsaturated 
heterocyclic groups containing Ito 4 nitrogen atoms such as, but not limited to, indolyl, 
isoindolyl, indolinyl, indolizinyl, benzimidazolyl, quinolyl, isoquinolyl, indazolyl, 
benzotriazolyl; unsaturated 3 to 8 membered rings containing 1 to 2 oxygen atoms and 1 to 3 
nitrogen atoms such as, but not limited to, oxazolyl, isoxazolyl, oxadiazolyl (e.g. 1,2,4- 
oxadiazolyl, 1,3,4-oxadiazolyl, 1,2,5-oxadiazolyl, etc.); saturated 3 to 8 membered rings 
containing 1 to 2 oxygen atoms and 1 to 3 nitrogen atoms such as, but not limited to, - - 
morpholinyl; unsaturated condensed heterocyclic groups containing 1 to 2 oxygen atoms and 1 to 
3 nitrogen atoms, for example, benzoxazolyl, benzoxadiazolyl, benzoxazinyl (e.g. 2H-1,4- 
benzoxazinyl etc.); unsaturated 3 to 8 membered rings containing 1 to 3 sulfur atoms and 1 to 3 
nitrogen atoms such as, but not limited to, thiazolyl, isothiazolyl, thiadiazolyl (e.g. 1,2,3- 
thiadiazolyl, 1,2,4-thiadiazolyl, 1,3,4-thiadiazolyl, 1,2,5-thiadiazolyl, etc.); saturated 3 to 8 
membered rings containing 1 to 2 sulfur atoms and 1 to 3 nitrogen atoms such as, but not limited 
to, thiazolodinyl; saturated and unsaturated 3 to 8 membered rings containing 1 to 2 sulfur atoms 
such as, but not limited to, thienyl, dihydrodithiinyl, dihydrodithionyl, tetrahydrothiophene, 
tetrahydrothiopyran; unsaturated condensed heterocyclic rings containing 1 to 2 sulfur atoms and 
1 to 3 nitrogen atoms such as, but not limited to, benzothiazolyl, benzothiadiazolyl, 
benzothiazinyl (e.g. 2H-l,4-benzothiazinyl, etc.), dihydrobenzothiazinyl (e.g. 2H-3,4- 
dihydrobenzothiazinyl, etc.), unsaturated 3 to 8 membered rings containing oxygen atoms such 
as, but not Umited to furyl; unsaturated condensed heterocyclic rings containing 1 to 2 oxygen 
atomis such as benzodioxolyl (e.g. 1,3-benzodioxoyl, etc.); unsaturated 3 to 8 membered rings 
containing an oxygen atom and 1 to 2 sulfur atoms such as, but not limited to, dihydrooxathiinyl; 
saturated 3 to 8 membered rings containing 1 to 2 oxygen atoms and 1 to 2 sulfur atoms such as 
1,4-oxathiane; unsaturated condensed rings containing 1 to 2 sulfur atoms such as benzothienyl, 
benzodithiinyl; and unsaturated condensed heterocyclic rings containing an oxygen atom and 1 
to 2 oxygen atoms such as benzoxathiinyl. Heterocyclyl group also include those described 
above in which one or more S atoms in the ring is double-bonded to one or two oxygen atoms 
(sulfoxides and sulfones). For example, heterocyclyl groups include tetrahydrothiophene, 
tetrahydrothiophene oxide, and tetrahydrothiophene 1,1-dioxide. Preferred heterocyclyl groups 
contain 5 or 6 ring members. More preferred heterocyclyl groups include morpholine, 
piperazine, piperidine, pyrrolidine, imidazole, pyrazole, 1,2,3-triazole, 1,2,4-triazole, tetrazole, 
thiomorphoUne, thiomorpholine in which the S atom of the thiomorphoUne is bonded to one or 
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more O atoms, pyrrole, homopiperazine, oxazolidin-2-one, pyrrolidin-2-one, oxazole, 
quinuclidine, thiazole, isoxazole, furan, and tetrahydrofuran. 

The phrase "substituted heterocyclyl" refers to an heterocyclyl group as defined above in 
which one of the ring members is bonded to a non-hydrogen atom such as described above with 
5 respect to substituted alkyl groups and substituted aryl groups. Examples, include, but are not 
limited to, 2-methylbenzimidazolyl, 5-methylbenzimidazolyl, 5-chlorobenzthiazolyl, 1-methyl 
piperazinyl, and 2-chloropyridyl among others. - - 

"Aminosulfonyl" refers to the group -S(0) 2-NH2. "Substituted aminosulfonyl" refersto 
the group -S(0) 2-NRR' where R is loweralkyl and R' is hydrogen or a loweralkyl. The term 
10 "aralkylaminosulfonlyaryl" refers to the group -aryl-S(O) 2-NH-aralkyl, where the aralkyl is 
loweraralkyl. 

"Carbonyl" refers to the divalent group -C(0)-. 

"Carbonyloxy" refers generally to the group -C(0)-0-,. Such groups include 
esters; -C(0)-0-R, where R is loweralkyl, cyclbalkyl, aryl. or loweraralkyl. The term 

15 "carbonyloxycycloalkyl" refers generally to both an "carbonyloxycarbocycloalkyl" and an 
"carbonyloxyheterocycloalkyl", i.e., where R is a carbocycloalkyl or heterocycloalkyl,- 
respectively. The term "arylcarbonyloxy" refers to the group -C(0)-0-aryl, where aryl is a 
mono- or polycyclic, carbocycloaryl or heterocycloaryl. The term "aralkylcarbonyloxy" refers t<J 
the group -C(0)-0-aralkyl, where the aralkyl is loweraralkyl. 

20 r The term "sulfonyl" refers to the group -SO2-. "Alkylsulfonyl" refers to a substituted 

sulfonyl of the structure -SO2R - in which R is alkyl. Alkylsulfonyl groups employed in 
compounds of the present invention are typically loweralkylsulfonyl groups having from 1 to 6 • ■ 
carbon atoms in its backbone strucmre. Thusj typical alkylsulfonyl groups employed in 
compounds of the present invention include, for example, methylsulfonyl (i.e., where R is 

25 methyl), ethylsulfonyl (i.e., where R is ethyl), propylsulfonyl (i.e where R is propyl), and the 
like.. The term "arylsulfonyl" refersto the group -SOa-aryl. The term "aralkylsulfonyl" refers to 
, the group -S02-aralkyl, in which the aralkyl is loweraralkyl. The term "sulfonamido" refers to - 
SO2NH2. 

The term "carbonylamino" refers to the divalent group -NH-C(O)- in which the hydrogen 
30 atom of the amide nitrogen of the carbonylamino group can be replaced a loweralkyl, aryl, or 

loweraralkyl group. Such groups include moieties such as carbamate esters (-NH-C(O)-O-R) and 
amides -NH-C(0)-0-R, where R is a straight or branched chain loweralkyl, cycloalkyl, or aryl 
or loweraralkyl. The term "loweralkylcarbonylamino" refers to alkylcarbonylamino where R is a 
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loweralkyi having from 1 to about 6 carbon atoms in its backbone structure. The term 
"arylcarbonylamino" refers to group -NH-C(0)-R where R is an aryl. Similarly, the tenn 
"aralkylcarbonylamino " refers to carbonylamino where R is a lower aralkyl. 

The term "guanidino" or "guanidyl" refers to moieties derived from guanidine, 
H2N-C(=NH)-NH2. Such moieties include those bonded at the nitrogen atom carrying the fonnal 
double bond (the "2"-position of the guanidine, e.g., diaminomethyleneamino, (H2N)2C=NH-) 
and those bonded at either of the nitrogen atoms carrying a formal single bond (the "1-" and/or 
"3"-positions of the guandine, e.g., H2N-C(=NH)-NH-.). The hydrogen atoms at any of the 
nitrogens can be replaced with a suitable substituent, such as loweralkyi, aryl, or loweraralkyl. 

Representative cycloimido and heterocycloimido groups include, for example, those 
shown below. These cycloimido and heterocycloimido can be further substituted and may be 
attached at various positions as will be apparent to those having skill in the organic and 
medicinal chemistry arts in conjunction with the disclosure herein. 




Representative substituted amidino and heterocycloamidino groups include, for example, 
those shown below. These amidino and heterocycloamidino groups can be further substituted as 
will be apparent to those having skill in the organic and medicinal chemistry arts in conjunction 
with the disclosure herein. 
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.5 



and K . 



Representative substituted alkylcarbonylamino, alkyloxycarbonylamino, 
aminoalkyloxycarbonylamino, and arylcarbonylamino groups include, for example, those shown 
below. These groups can be further substituted as will be apparent to those having skill in the - 
organic and medicinal chemistry arts in conjunction with the disclosure herein. 

\ \ \ \ 0"^^^^^° 



IN^O 



and OH 



10 Representative substituted aminocarbonyl groups include, for example, those shown 

below. These can heterocyclo groups be further substituted as will be apparent to those having 
skill in the organic and medicinal chemistry arts in conjunction with the disclosure herein. 

Y° ^° T 



NHz 



.J. HN^ 



Representative substituted alkoxycarbonyl groups include, for example, those shown 
below. These alkoxycarbonyl groups can be further substituted as will be apparent to those 
having skill in the organic and medicinal chemistry arts in conjunction with the disclosure 
X) herein. 
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5 "Substituted" refers to the dejHnite replacement of hydrogen with one or more 

monovalent or divalent radicals. Suitable substitution groups include, those described herein for 
particular groups, as well as hydroxyl, nitro, amino, imino, cyano, halo, thio, thioamido,. 
amidino, imidino, oxo, oxamidino, methoxamidino, imidino, guanidino, sulfonamide, carboxyl, 
formyl, alkyl, substituted alkyi, haloloweralkyl, loweralkoxy, haloloweralkoxy, loweralkoxy- 
10 alkyl, alkylcarbonyl, arylcarbonyl, aralkylcarbonyl. heteroarylcarbonyl, heteroaralkylcarbonyl, -. 
alkylthio, aminoalkyl, cyanoalkyl, benzyl, pyridyl. pyrazolyl, pyrrole, thiophene, imidazblyl. and 
the like. 

The term "linking moiety" refers to a covalent bond or an uncyclized divalent group, 
such as, for example, -CO-, -0-. -S-, -CH2-, -NH-, and substituted or unsubstituted alkyl, 

15 alkenyl, alkynyl, carbonyl, alkoxycarbonyl groups as defined herein. 

The term "SMIP compound" refers to small molecule immunopotentiating compounds, 
that include small molecule compounds below about MW 1000 g/mol, preferably MW 800 
g/mol that are capable of stimulating or modulating a pro-inflammatory response in a patient. In 
an embodiment, the SMIP compounds are able to stimulate human peripheral blood mononuclear 

10 ceUs to produce cytokines. Preferred SMBP compounds and derivatives thereof include, for 
example, aminoazavinyl compounds, benzazole compounds, acylpiperazine compoundsi 
indoledione compounds, tetrahydroisoquinoline (THIQ) compounds, anthraquinone compounds, 
indanedione compounds, pthalimide compounds, benzocyclodione compounds, 
aminobenzimidazole quinoHnone (ABIQ) compounds, hydraphthalimide compounds, 

15 pyrazolopyrimidine compounds, quinazilinone compounds, quinoxaline compounds, triazine 
compounds, tetrahydropyrrolidinoquinoxaline compounds, pyrrole compounds, benzophenone 
compounds, sterol compound, and isoxazole compounds. 
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The term "SMIS compound" refeis to small molecule immunosuppressant compounds, 
that include small molecule coinpounds below about about MW 1000 g/mol, preferably MW 800 
g/mol, capable of suppressing or modulating a pro-inflanamatory response in a patient. 

Acylpiperazine compounds as described throughout this application include compounds 
of formula (HI) as shown below: 

: (Rio)n 

w 

wherein, 

R9 is sel^ted from the group consisting of substimted or unsubstituted aryl, heteroaryl, 
arylalkyl, arylalkenyl, heteroarylalkyl, and heteroarylalkenyl; 
Rio is substimted or unsubstituted alkyl; 

n is an integer from 0-2; and. 
if Di is carbon than D2 is oxygen, D3 is absent, and D4 is selected from the group consisting 

of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fused arylaryl, fiised 

arylheteroaryl, and fused heteroarylaryl; or, 

if Di is nitrogen than D2 is nitrogen, D4 is absent, and D3 is selected from the group 
consisting of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fiised 
arylaryl; fused arylheteroaryl, and fused heteroarylaryl. 

Indoledione compounds as described throughout this application include compounds of 
formula (TV) as shown below: 



Ri3 
N 

O 



IV 
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Rii and R,2 are independently selected from the group consisting of H, nitro, halogen, 
amino, hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl. alkylcarbonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy. heteroarylalkoxy, alkylamino. arylalkylamino. arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl groups; and, 

Ri3 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, heteroarylalkyl, heterocyclyl, heterocyclylalkyl, and alkylbenzyl. 

TetrahydroisoquinoUne (THIQ) compounds as described throughout this appHcation 
include compounds of formula (V) as shown below: 




Ri7 
V 

wherein, 

L is a covalent bond or selected from the group consisting of -CH2-, -CO-, -0-, -S-, CHF, 
-NH-, -NR20-, where R20 is lower alkyl; 

Ri4 is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; 

Ri5 is selected from the group consisting of substituted or unsubstimted carbocyclyl, aryl, 
arylalkyl, alkoxyaryl, heteroaryl, heterocyclyl; 

R16 is selected from the group consisting of hydrogen, halogen, and substituted or 
, unsubstituted alkyl; 

R,7 is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; and, 

R18 and Ri9 are independently selected from the group consisting of H. hydroxy, halogen, 
alkoxy. amino, unsubstituted alkyl, substituted alkyl, and alkylamino. 



Benzocyclodione compounds as described throughout this application include 
compounds of formula (VI) as shown below: 
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P 



O 



VI 



wherein. 



E is selected from the group consisting of NRas or CR26R27; 

R2i» R23. and R24 are independently selected from the group consisting of H, hydroxy, 

halogen, alkoxy, amino, unsubstituted alkyl, substituted alkyl, and alkylamino; 

R22 is selected from the group consisting or H, hydroxy, halogen, alkoxy, amino, and 

unsubstituted or substimted alkyl, and alkylamino, arylalkyl, heteroarylalkyl, aryl, 

heteroaryl, arylcarbonyl, heterocyclyl, heterocyclylalkyl, and heteroarylcarbonyl; 

R25 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 

heterocyclyl, carbocyclyl, arylalkyl, heteroarylalkyl, and heterocycly alkyl; 

R26 is selected from the group consisting of H, halogen, hydroxy, amino, and substituted 

or unsubstimted alkyl, carbonylalkyl, and alkylcarbonylalkyl; and; 

R27 is selected from the group aryl, arylalkyl, heteroarylalkyl, heterocyclyl, 

heterocyclylalkyl, carbocyclyl, arylcarbonylalkyl, and arylalkylcarbonyl. 

Aminoazavinyl compounds as described throughout this application include compounds 
of formula (VII) as shown below: 



G is either S or NH; 

R28 is selected from the group consisting of H, and substituted or unsubstimted alkyl, 
aryl, heteroaryl, heteroarylalkyl, arylalkyl, carbocyclyl, carbocyclylalkyl, heterocyclyl, 
and heterocyclylalkyl; 

Q is selected from the group consisting of hydrogen, substituted alkyl, unsubstituted alkyl, 
and aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclyl, substituted 




vn 



wherein. 
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heterocyclyl, fused or unfused arylaryl, substituted arylaryl, arylheteroaryl, substituted 
arylheteroaryl, heteroarylheteroaryl, and substituted heteroarylheteroaryl; 

Vi is selected from the group consisting of alkyl, substituted alkyl, aryl, substituted aryl, 
arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, heteroarylalkyl, 
substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, alkoxycarbonyl, 
carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted alkylcarbonyl, 
arylcarbonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, fonnyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamide, aminoalkoxy, alkylamino, heteroaryl amino, alkylcarbonylamino, 
alkylaminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 
heteroarylcarbonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido; and, 

V2 is selected from the group consisting of hydrodgen, halogen, alkyl, substituted alkyl, 
aryl, substituted aryl, arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted 
alkylcarbonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 
heteroarylcarbonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido. 

Lactam compounds as described throughout this application include compounds of 
formula (Vni) as shown below: 
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vin _ - 

wherein, 

Wi is selected from the group consisting of -OH, -OR36 groups, -NR37R38; 

W2 is selected from the group consisting of O, S, and NR39 groups; 

R29 and R30 join to form a 5 to 6 membered substituted or unsubstituted ring comprising 

all carbon atoms or at least one O, N, or S atom; 

R35 and R39 may be the same or different and are selected from the group consisting of H, 
. -OH substituted and unsubstituted alkyl groups, substituted and unsubstituted aryl 
groups, -C(=0)H, -C(=0)-alkyl groups, and -C(=0)-aryl groups; 
R31, R32, R33, and R34 may be the same or different and are independently selected from 
the group consisting of H, CI, Br, F, I, -NO2, -CN, -OH, -OR40 groups, -NR41R42 groups, 
-C(=0)R43 groups, -SH groups, substituted and unsubstituted amidinyl groups, 
substituted and unsubstituted guanidinyl groups, substituted and unsubstituted alkyl 
groups, substituted and unsubstituted aryl groups, substituted and unsubstituted alkenyl 
groups, substituted and unsubstituted alkynyl groups, substimted and unsubstituted 
heterocyclyl groups, substimted and unsubstituted alkylaminoalkyl groups, substituted 
and unsubstituted dialkylaminoalkyl groups, substimted and unsubstituted arylaminoalkyl 
groups, substituted and unsubstituted diarylaminoalkyl groups, substituted and 
unsubstituted (alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
and unsubstituted heterocyclylaminoalkyl groups, substituted and unsubstituted 
diheterocyclylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted 
(aryl)(heterocyclyl)aminoalkyl groups, substimted and unsubstituted hydroxyalkyl 
groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; 
R36 is selected from the group consisting of substituted and unsubstituted alkyl groups, 
substimted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
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groups, substituted and unsubstituted heterocyclylalkyl groups, -C(=0)H, -C(=0)-alkyl 
groups. -C(=0)-aryl groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH2, - 
C(=0)NH(alkyl) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, - 
C(=0)N(aryl)2 groups, -C(=0)N(alkyl)(aryl) groups, -NHz, -NH(alkyl) groups, -NH(aryl) 
groups, -N(alkyl)2 groups, -N(alkyl)(aryl) groups, -N(aryl)2 
groups, -C(=0)NH(heterocyclyl) groups, -C(=0)N(heterocyclyl)2 
groups, -C(=0)N(alkyl)(heterocyclyl) groups, and -C(=0)N(aryl)(heterocyclyl) groups; 
R37 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, and substituted and unsubstituted heterocyclyl 
groups; 

R38 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
groups, -OH, alkoxy groups, aryloxy groups, -NH2, substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 
dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl groups, 
substituted and unsubstituted diarylaminoalkyl groups, substituted and unsubstituted. 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted alkylamino groups, 
substituted and unsubstituted arylamino groups, substituted and unsubstituted 
dialkylamino groups, substituted and unsubstituted diarylamino groups, substituted and 
unsubstitiited (alkyl)(aryl)amino groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0)-aryl 
groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH2, -C(=0)NH(alkyl) 
groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, -C(=0)N(aryl)2 
groups, -C(=0)N(alkyl)(aryl) groups, -C(=0)-heterocyclyl groups, -C(=0)-0- 
heterocyclyl groups, -C(=0)NH(heterocyclyl) groups. -C(=0)-N(heterocyclyl)2 groups, - 
C(=0)-N(alkyl)(heterocyclyl) groups. -C(=0)-N(aryl)(heterocyclyl) groups, substituted - 
and unsubstituted heterocyclylaminoalkyl groups, substituted and unsubstituted 
diheterocyclylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted 
(aryI)(heterocyclyl)aminoalkyl groups, substitiited and unsubstituted hydroxyalkyl 
groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; 
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R41 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, and substituted and unsubstituted heterocyclyl 
groups; 

R42 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0)-aryl groups, - 

C(=0)NH2, -C(=0)NH(alkyI) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, 
-C(=0)N(aryl)2 groups, -C(=0)N(alkyl)(aryl) groups, -C(=0)0-alkyI 
groups, -C(=0)0-aryl groups, substituted and unsubstituted aminoalkyl groups, 
substituted and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 
dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl groups, 
substituted and unsubstituted diarylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted heterocyclylalkyl groups, - 
C(=0)-heterocyclyl groups, -C(=0)-0-heterocyclyl groups, -C(=0)NH(heterocyclyl) 
groups, -C(=0)-N(heterocyclyl)2 groups, -C(=0)-N(alkyl)(het6rocyclyl) groups; -C(=0)- 
N(aryl)(heterocyclyl) groups, substituted and unsubstituted heterocyclylaminoalkyl 
groups, substituted and unsubstituted diheterocyclylaminoalkyl groups, substituted and 
unsubstituted (heterocyclyl)(alkyl)aminoalkyl groups, substituted iand unsubstituted 
(heterocyclyl)(aryl)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
. groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
. aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; and . 
R43 is selected from the group consisting of H, -NH2, -NH(alkyl) groups, -NH(aryl) 
groups, -N(alkyl)2 groups, -N(aryl)2 groups, -N(alkyl)(aryl) groups, -NH(heterocyclyl) 
groups, -N(heterocyclyl)(alkyl) groups, -N(heterocyclyl)(aryl) groups, -N(heterocyclyl)2 
groups, substituted and unsubstituted alkyl groups, substituted and unsubstituted aryl 
groups, -OH, substituted and unsubstituted alkoxy groups, substituted and unsubstituted 
heterocyclyl groups, substituted and unsubstituted aryloxy groups, heterocyclyloxy 
groups, -NHOH, -N(alkyI)OH groups, -N(aryl)OH groups, -N(alkyl)0-alkyl groups, - 
N(aryl)0-alkyl groups, -N(aIkyl)0-aryl groups, and -N(aryl)0-aryl groups. 

Preferably R29 and R30 join together to form a substituted or unsubstituted phenyl ring. 
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Hydropthalamide compounds as described throughout this application include 
compounds of fomiula (IX) as shown below: 

^46^45 O 
"47 



R4I 




IX - 

wherein, 

R44 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, heteroarylalkyl, fused arylaryl, unfiised arylaryl, fused heteroarylaryl, unfused 
heteroarylaryl, fused arylheteroaryl, and unfused arylheteroaryl; 
: R45, R47,,R49, and R51 may be the same or different and are independendy selected ftom 
. the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylamino, heteroarylamino, heteroarylaminoalkyl, 
heterpcyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl; and 
R46, R48.. R50. and R52 may be the same or different and are independently selected from 
the group consisting of H, halogen, and substituted or unsubstituted alkyl groups. 



Benzophenone compounds as described throughout this application include compounds 
of fqimula (X) as shown below: 

O 




X 

wherein, 

R53 is independently selected from the group consisting of H, nitro, halogen, amino, 
hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl; 
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R54 is independently selected ficom the group consisting of H, iiitro, halogen, amino, 
hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcarbonylamino, aUcylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl; and 

o and p are integers from 0-4. _ - 

Isoxazole compounds as described throughout this application include compounds of 
formula (XI) as shown below: 




XI 

wherein, 

R55 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 

heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl; 

R56 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 

heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl; and, 

R57 is selected from the group consisting of H, halogen, hydoxy, and substituted or 

unsubstituted alkyl, aryl, heteroaryl. heterocyclyl, and carbonyl. 

Sterol compounds as described throughout this application include compounds of 
formula (XIT) as shown below: 




wherein, 

R58 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 
heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl. 
Preferably Rsg is a pyranone substituent. 
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Quinazilinone compounds as described throughout this application include compounds of 
formula (Xm) as shown below: 

Rso Rf51 




N 

O R64 

xm - ■ 

wherein, 

R59 is selected from the group consisting of H, halogen, hydroxy, and substituted or 
unsubstituted alkyl, aminoalkyl, alldyaminoalkyl, alkoxy, dialkylaminoalkyl, 
hydroxyalkyl, alkenyl, alkynyl, ciarbocyclyl, carbocyclylalkyl, heterocyclyl, and 
heterocyclylalkyl; 

Reo is selected from the group consisting of substituted or unsubstituted aryl. heteroaryl, 
arylalkyl, heteroarylalkyl, and heterocyclylalkyl; and, 

Rei, R62. R63, and R64 may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, caiboxcycKc acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, ^lalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylamino. heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl groups. 

Pyrrole compounds as described throughout this apphcation include compounds of 
formula (XIV) as shown below: 

^68 .^67 

"69 ^n"^ Rbb 
I 

Res 
XIV 

wherein. 

Res is selected from the group consisting of H, hydroxy, and substituted or unsubstituted 
alkyl. aryl. heteroaryl. heteroarylalkyl, arylalkyl, heteroarylaminoalkyl, arylaminoalkyl, 
heteroaryloxyalkyl, and aryloxyalkyl groups; 

Ree, R67, Rgg, and may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
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substituted or unsubstituted alkyl, aryl, heteioaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylainino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, arylaikylamino, arylamino, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl groups. 

5 

Further preferred pyrrole compounds include those shown in Formula (XV): 




10 CXV) 

wherein: 

Ki is nitrogen, oxygen, or optionally substituted carbon; 

W is absent or is selected from the group consisting of -0-, -S-, -S(0)-, -SO2-, -NH-, -NH- 
.5 CO-, -NR'CO-, -NHSO2-, -NR'S02-, -CO-, -COj-, -CHj--, -CF2~, CHF, -CONH-, -CONR'-, 
and -NR'-, where R* is alkyl, substituted alkyl, cycloalkyl, aryl, heteroaryl, heterocyclo; 
At is optionally substituted aryl, heteroaryl, or a protecting group; 

R70 and R70' are independently selected from the group consisting of hydrogen and methyl; 
R71. R72» R73. and R74 are independently selected from the group consisting of hydrogen, 
0 hydroxyl, and optionally substituted loweralkyl, cycloloweralkyl, cyclicaminoalkyl, alkyl- 
aminoalkyl, loweralkoxy, amino, alkylamino, alkylcarbonyl. arylcarbonyl, aralkylcarbonyl, 
heteroarylcarbonyl, heteroaralkylcarbonyl, aryl and heteroaryl; 

R75 and R78 are independently selected from the group consisting of hydrogen, halo, and 
optionally substituted loweralkyl, cycloalkyl, alkoxy, amino, aminoalkoxy, carbonyloxy, 
5 aminocarbonyloxy, alkylcarbonylamino, arylcarbonylamino, aralkylcarbonylamino, hetero- 
arylcarbonylamino, heteroaralkylcarbonylamino, cycloimido, heterocycloimido, amidino, 
cycloamidino, heterocycloamidino, guanidinyl, aryl, heteroaryl, heterocycloalkyl, 
heterocyclocarbonyloxy, heteroarylcarbonyloxy, and arylsulfonamido; 

R76 is selected from the group consisting of hydrogen, aryl, heteroaryl, substituted heteroaryl, 
0 heterocyclyl, and substituted heterocyclyl; 
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R77 is selected &om the group consisting of hydrogen, hydroxy, halo, carboxyl, nitro, amino, 
amido, amidino. imido, cyano, sulfonyl, methanesulonyl, and substituted or unsubstituted alkyl. 
alkoxy. alkylcarbonyl, arylcarbonyl, aralkylcarbonyl. heteroarylcarbonyl, heteroaralkylcarbonyl. 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, fonnyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamido,aminoalkoxy,alkylamino,heteroarylamino,alkylcarbonylamino, - ' 

alkylaminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 
heteroarylcarbonylamino, arylcarbonylamino, heteroarylcarbonylamino cycloamido, 
cyclothioamido, cycloamidino, heterocycloamidino. cycloalkyl, cycloimido, heterocycloimido, 
guanidinyl. aryl, heteroaryl. heteiocyclo. heterocycloalkyl. arylsulfonyl and arylsulfonamido; ' 

Anthraquinone compounds of the instant invention include, for example, compounds of 
Fonnula (XVI): 

R79 o 

R82 O 
XVI 

wherein, 

R79, Rso, Rsi, and Rgz may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylamino, sulfonyl. aminosulfonyl, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino. arylalkylamino, arylamino, heteroarylamino. 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; and. 

R83 and R84 are taken together to form a substituted or unsubstituted 5-6 membered ring 
containing all carbon atoms or 1-2 heteroatoms selected from the group consisting of O, 
S.andN. 
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Quinoxaline compounds referred to throughout this application include tricyclic, partially 
unconjugated compounds optionally substituted with nitrogen heteroatoms as shown in the 
preferred quinoxaline embodiment (XVII) below: 




wherein, 

Ji is either C or N, 

Ji* is selected from the group consisting of H, substituted aryl, unsubstituted aryl, 
substituted heteroaryl, and. unsubstituted heteroaryl; 
J2 is either C or N, 

J2' is selected from the group consisting of H, substituted aryl, unsubstituted aryl, 

substituted heteroaryl, and unsubstituted heteroaryl; 

J3 is selected from the group consisting of -CO-, -NH-, and -N=; 

if J4 is -O- then J4' is absent; or, 

if J4 is =C- then J4' is selected ftom the group consisting of H and substituted or 
unsubstituted aUcyl, alkbxy, aryl, heteroaryl, heteroarylalkyl, arylalkyl, aminoaUcyl, 
alkylamino, and alkylthio groups; and, 

K.85> R86> R87> I^8> and Rgg may be the same or different and are independently selected 
from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, 
and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, aUcylcarbonyl, 
alkylcarbonylamino, sulfonyl, aminosulfonyl, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups. 

Triazine compounds refer to substituted 6-membered heterocyclic groups with 3 nitrogen 
atoms distributed throughout the ring. The preferred embodiments of the instant invention 
include those shown in stmctures (XVHI), (XIX) and (XX) shown below: 



-250- 



wo 2004/092360 



PCT/US2004/011710 



Q ^93 



xvm 

wherein, 

R90 is selected from the group consisting of substituted or unsubstituted alkyl, alkenyl, 
akynyl, aryl, heteroaryl, heteroarylalkyl, heteroarylalkenyl, arylalkyl, and arylalkenyl; 
R91 and R93 are independently selected from the group consisting of H, and unsubstituted 
alkyl; 

R91 is aryl; preferably phenyl, 

R94\^N^R97 
T H R96 

R95 

xrx 

wherein, 

R94 is selected from the group consisting of H, amino, alkyl, aminoalkyl, and halogen; 
R95 is selected from the group consisting of substituted or unsubstituted aryl, arylamino, 
arylalkylamino, heteroaryl, heteroarylamino, and heteroalkylamino; ' 
R96 and R97 are independently selected from the group consisting of H, halogen, and 
alkyl, preferably methyl; or, 

R96 may form a double bond with the nitrogen atom directly below it as indicated by the 
dashed line in the above structure; and. 



R98>.,^A^N^R. 



100 

N 



R99 

XX 

wherein. 
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R98 is selected firom the group consisting of H, substituted alkyl, and unsubstituted alkyl; 
preferably methyl, 

R99 is selected from the group consisting of H, substituted alkyl, and unsubstituted alkyl; 
preferably ethyl, 

5 Rioo is selected from the group consisting of substituted or unsubstituted aiyl, heteroaryl, 

alkoxyaryl, arylalkyl, and heteroarylalkyl. 



Benzazole compounds as described throughout this application include compounds of 
10 formula (XXI) as shown below: 




XXI 

wherein, 

A is selected from the group consisting of -0-, -S-, -NH-, and -NRg-; 

15 W is selected from the group consisting of -CH2-, -0-, -S-, -NH-r, and -NRg-; 

R7 is selected from the group consisting of carbocyclyl, unfused carbocyclylcarbocyclyl, 
substituted aryl, unsubstituted aryl, substituted hetCToaryl, unsubstituted heteroaryl, 
substituted fused arylheteroaryl, unsubstituted fused arylheteroaryl, substituted unfused 
arylaryl and unsubstituted unfused arylaryl; 

10 Re is selected from the group consisting of substituted or unsubstituted aryl, and 

heteroaryl; and, 

Rg is independently substimted or unsubstituted alkyl. 

Pyrazalopyrimidine compounds as described throughout this application include 
15 compounds of formula (XXII) as shown below: 



Rl05 




xxn 
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wherein, 

Rioi is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, 
alkylcarbonyl, alkylcarbonylamino, sulfonyl, aminosulfonyl, alkylammocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl groups; _ - 

Ri02 is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, 
alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, 
heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

Ri03 is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic acid, trifluoromethyl, and substituted or unsubstituted alkyl, aryl, heteroaryl, 
alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

Ri04 is selected from the group consisting of H and substituted or unsubstituted aryl, 
heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 

Rio5 is selected from the group consisting of H and substituted or unsubstituted aryl, 
heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 
wherein at least one of R104 and R105 is not H. 

SMIP compounds identified by in-vitro (cellular or non-ceUular assays) or in-vivo 
methods are thoroughly described in Methods 1 and 2 below. 
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Phannaceutical compositions containing the compounds of the invention may be in any 
form suitable for the intended method of administration, including, for example, a solution, a 
suspension, or an emulsion. Liquid carriers are typically used in preparing solutions, 
suspensions, and emulsions. Liquid carriers contemplated for use in the practice of the present 
invention include, for example, water, saline, phannaceutically acceptable organic solvent(s), 
phaimaceutically acceptable oils or fats, and the like, as well as mixtures of two or more thereof. 
The liquid carrier may contain other suitable phannaceutically acceptable additives such^as 
solubilizers, emulsifiers, nutrients, buffers, preservatives, suspending agents, thickening agents, 
viscosity regulators, stabilizers, and the like. Suitable organic solvents include, for example, 
monohydric alcohols, such as ethanol, and polyhydric alcohols, such as glycols. Suitable oils 
include, for example, soybean oil, coconut oil, olive oil, safDower oil, cottonseed oil, and the 
like. For parenteral administration, the carrier can also be an oily fester such as ethyl oleate, 
isopropyl myristate, and the like. Compositions of the present invention may also be in the form • 
of microparticles, microcapsules, liposomal encapsulates, and the like, as well as combinations 
of any two or more thereof. 

Other additives include inmiunostimulatory agents known in the art. Immunostimulatory 
oligonucleotides and polynucleotides are described in PCX WO 98/55495 and PCT WO 
98/16247. U.S. Patent Application No. 2002/0164341 describes adjuvants including an 
unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic acid adjuvant. U.S. Patent 
Application No. 2002/0197269 describes compositions comprising an antigen, an antigenic 
CpG-ODN and a polycationic polymer. Other immunostimulatory additives described in the art • 
may be used, for example, as described in U.S. Patent No. 5,026,546; U.S. Patent No. 4,806,352;. 
and U.S. Patent No. 5,026,543. 

A controlled release delivery system may be used, such as a diffusion controlled matrix •. 
system or an erodible system, as described for example in: Lee, "Diffusion-Controlled Matrix 
Systems", pp. 155-198 and Ron and Langer, "Erodible Systems", pp. 199-224, in 'Treatise on 
Controlled Drug Delivery", A. Kydonieus Ed., Marcel Dekker, Inc., New York 1992. The 
matrix may be, for example, a biodegradable material that can degrade spontaneously in situ and 
in vivo for, example, by hydrolysis or enzymatic cleavage, e.g., by proteases. The delivery 
system may be, for example, a naturally occurring or synthetic polymer or copolymer, for 
example in the form of a hydrogel. Exemplary polymers with cleavable linkages include 
polyesters, polyortfioesters, polyanhydrides, polysaccharides, poly(phosphoesters), polyamides, 
polyurethanes, poly(imidocarbonates) and polyCphosphazenes). 
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The compounds of the invention may be administered enterally, orally, parenterally, 
sublingually, by inhalation spray, rectally, or topically in dosage unit formulations containing 
conventional nontoxic pharmaceutically acceptable carriers, adjuvants, and vehicles as desired. 
For example, suitable modes of administration include oral, subcutaneous, transdermal, 
5 transmucosal, iontophoretic, intravenous, intramuscular, intraperitoneal, intranasal, subdennal, 
rectal, and the like. Topical administration may also involve the use of transdermal 
administration such as transdermal patches or ionophoresis devices. The term parenteral 
includes subcutaneous injections, intravenous, intramuscular, intrastemal injection, or infusion 
techniques. 

10 . Injectable preparations, for example, straile injectable aqueous or oleaginous suspensions 

may be formulated according to the known art using suitable dispersing or wetting agents and 
suspending agents. The sterile injectable preparation may also be a sterile injectable solution or 
suspension in a nontoxic parenterally acceptable diluent or solvent, for example, as a solution in 
1,3-propanediol. Among the acceptable vehicles and solvents that may be employed are water, 

15 Ringer's solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils are 

conventionally employed as a solvent or suspending medium. For this purpose any bland fixed 
oil may be employed including synthetic mono- or diglycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectables. 

Suppositories for rectal administration of the drug can be prepared by mixing the drug 

20 with a suitable nonirritating excipient such as cocoa butter and polyethylene glycols that are solid 
at ordinary temperatures but liquid at the rectal temperature and will therefore melt in the rectum 
and release the drug. 

Solid dosage forms for oral administration may include capsules, tablets, pills, powders, • 
and granules. In such solid dosage forms, the active compound may be admixed with at least 

25 one inert diluent such as sucrose lactose or starch. Such dosage forms may also comprise, as is 
normal practice, additional substances other than inert diluents, e.g., lubricating agents such as 
magnesium stearate. In the case of capsules, tablets, and pills, the dosage forms may also 
comprise buffering agents. Tablets and pills can additionally be prepared with enteric coatings. 
Liquid dosage forms for oral administration may include pharmaceutically acceptable 

30 emulsions, solutions, suspensions, syrups, and elixirs containing inert diluents commonly used in 
the art, such as water. Such compositions may also comprise adjuvants, such as wetting agents, 
emulsifying and suspending agents, cyclodextrins, and sweetening, flavoring, and perfuming 
agents. 
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As to the mode of administration, it should be emphasized that it is the combination of 
therapeutic agents that gives rise to its synergistic therapeutic effect no matter whether the first 
and the second agent are administered together or separately. Therefore, the two agents may be 
given together in a single dose or in separate ones with respect to space and time. 

Effective amounts of the compounds of the invention generally include any amount 
sufficient to detectably treat viral infections. 

Successful trea^ent of a subject in accordance with the invention may result in the 
inducement of a reduction or alleviation of symptoms in a subject afflicted with a medical or 
biological disorder to, for example, halt the further progression of the disorder, or the prevention 
of the disorder. 

The amount of active ingredient that may be combined with the carrier materials to 
produce a single dosage form will vary depending upon the host treated aiid the particular mode 
of administration. It will be understood, however, that the specific dose level for any particular 
patient will depend upon a variety of factors including the activity of the specific compound 
employed, the age, body weight, general health, sex, diet, time of administration, route of 
administration, rate of excretion, drug combination, and the severity of the particular disease 
undergoing therapy. The therapeutically effective amount for a given situation can be readily 
determined by routine experimentation and is within the skill and judgment of the ordinary 
clinician. 

The compounds of the present invention can also be administered in the form of 
liposomes. As is known in the art^ liposomes are generally derived from phospholipids or other 
lipid substances. Liposomes are formed by mono- or multilamellar hydrated liquid crystals that 
are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
metabolizable lipid capable of forming liposomes can be used. The present compositions in 
liposome form can contain, in addition to a compound of the present invention, stabilizers, 
preservatives, excipients, and the like. The preferred lipids are the phospholipids and 
phosphatidyl cholines (lecithins), both natural and synthetic. Methods to form liposomes are 
known in the art. See, for example, Prescott, Ed., Methods in Cell Biology, Volume XIV, 
Academic Press, New York, N.W., p. 33 et seq (1976). 

While the SMIP compounds of the invention can be administered as the sole active 
pharmaceutical agent, they can also be used in combination with one or more other agents used 
in the treatment of SARSs. Other representative agents useful in combination with the 
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compounds of the invention for the treatment of viral infections include, for example, interferon, 
ribavirin, gancyclovir and the like. 

When additional active agents are used in combination with the compounds of the present 
invention, the additional active agents may generally be employed in therapeutic amounts as 
indicated in the Physicians' Desk Reference (PDR) 53"* Edition (1999), that is incorporated 
herein by reference, or such therapeutically useful amounts as would be known to one of 
ordinary skill in the art - - 

The compounds of the invention and the other therapeutically active agents can be 
administered at the recommended maximum clinical dosage or at lower doses. Dosage levels of 
the active compounds in the compositions of the invention may be varied so as to obtain a 
desired therapeutic response depending on the route of administration, severity of the disease and 
the response of the patient. The combination can be administered as separate compositions or as 
a single dosage form containing both agents. When administered as a combination, the 
therapeutic agents can be formulated as separate compositions that are given at the same time or 
different times, or the therapeutic agents can be given as a single composition. 

Compounds of the present invention can be readily synthesized using the methods 
described herein, or other methods, that are well known in the art. . 

The compounds can be used in the form of salts derived from inorganic or organic acids. 
These salts include but are not limited to the following: acetate, adipate, alginate, citrate, 
aspartate, benzoate, benzenesulfonate, bisulfate, butyrate, camphorate, camphorsulfonate, 
digluconate, cyclopentanepropionate, dodecylsulfate, ethanesulfonate, glucoheptanoate, 
glycerophosphate, hemisulfate, heptanoate, hexanoate, fiimarate, hydrochloride, hydrobromide, 
hydroiodide, 2-hydroxyethanesulfonate, lactate, maleate, methanesulfonate, nicotinate, 
2-napthalenesulfonate, oxalate, pamoate, pectinate, persulfate, 3-phenylproionate, picrate, 
pivalate, propionate, succinate, tartrate, thiocyanate, p-toluenesulfonate and undecanoate. Also, 
the basic nitrogen-containing groups can be quatemized with such agents as loweralkyl halides, 
such as methyl, ethyl, propyl, and butyl chloride, bromides, and iodides; dialkyl sulfates like 
dimethyl, diethyl, dibutyl, and diamyl sulfates, long chain halides such as decyl, lauryl, myristyl 
and stearyl chlorides, bromides and iodides, aralkyl halides like benzyl and phenethyl bromides, 
and others. Water or oil-soluble or dispersible products are thereby obtained. 

Examples of acids that may be employed to form pharmaceutically acceptable acid 
addition salts include such inorganic acids as hydrochloric acid, sulphuric acid and phosphoric 
acid and such organic acids as oxalic acid, maleic acid, succinic acid and citric acid. Basic 
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addition salts can be prepared in situ during the final isolation and purification of the compounds 
of formula (I), or separately by reacting carboxylic acid moieties with a suitable base such as the 
hydroxide, carbonate or bicarbonate of a pharmaceutical acceptable metal cation or with 
ammonia, or an organic primary, secondary or tertiary amine. Pharmaceutical acceptable salts 
include, but are not limited to, cations based on the alkali and alkaline earth metals, such as 
sodium, lithium, potassium, calcium, magnesium, aluminum salts and the like, as well as 
nontoxic anmionium, quaternary ammonium, and amine cations, including, but not limited to' 
ammonium, tetramethylammonium, tetraethylammoniuin, methylamine, dimethylamine, 
trimethylamine, triethylamine, ethylamine, and the like. Other representative organic amines 
useful for the formation of base addition salts include diethylamine, ethylenediamine, 
ethanolamine, diethanolamine, piperazine and the like. 

Various compounds and methods of their synthesis are disclosed in intemational patent 
application Publication Nos. WO02/1 8327 (benzamide and pyridylamide based compounds); 
WO0222598, and WO02/18383 (ABIQ based compounds); and WO 02/81443 (pthalamide base 
compounds), that have been found within context of this invention to be useful for immune 
potentiation. The entire disclosure of these U.S. and intemational publications is incorporated 
herein by this reference. Other compounds or intermediates of interest in the present invention 
were purchased from commercially available sources using the following method: the chemical 
structure of interest was drawn into the ACD-SC database (from MDL Information Systems). A 
search of the following companies/institutions, among others, retrieved the identified 
compound's supplier and purchasing information: ASDI, ASINEX, BIONET, CHEMBRIDGE, 
CHEMDIV, CHEMEX, CHEMSTAR, COMGENEX, CSC, BSTTERBIOSCREEN, LABOTEST, 
MAYBRIDGE, MICROSOURCE/GENESIS, OLIVIA, ORION, PEAKDALE, RYAN 
SCIENTIFIC, SPECS, TIMTEC, U OF FLORIDA, and ZELINSKY. 

BENZAZOLE COMPOtJNDS 
Scheme 1 

Compounds of the invention containing a benzimidazole core may be prepared using a 
number of methods familiar to one of skill in the art. In one method, suitably functionalized 
diamines may be coupled with various thioisocyanates to form the intermediate thioureas. 
Cyclization to form the benzimidazole moiety may be effected under known conditions such as 
with treatment carbodiimides or alkyl halides. Alternatively the diamines may be reacted 
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sequentially with carbonyl diimidazole and phosphoryl chloride followed by coupling with the 
appropriate amine. 

ArNCS fT^^'^IS 



'N^^NHAr » 
H 



• (lmid)2CO ...-^^N 
I 2- POCI3 (I y N>-_ci '^'■'^"2 



Compounds containing the oxazole structure may similarly be prepared according to the 
methods above or according to other known general procedures. Haviv et. al. (J. Med. Chem. 
1988, 31, 1719) describes a procedure for assembling oxazole cores wherein a hydroxy aniline is 
treated with ethyl potassium xanthate. The resulting sulfuryl benzoxazole may then be 
chlorinated and coupled with an amine. 

pc: ccysH o::>— ' 

R R ^ 

Compounds containing a benzothiazole core may also be prepared according to known 

methods. An brtho-halothioisocyanate may be reacted with an amine to form a thiourea. 

Reduction with NaH then allows formation of the thiazole ring. 

per pc:xi p::v- ^ 

Benzothiazoles may generally be substituted in accordance with the present invention, 
such as through the following synthetic pathway: 



.NYY°^ Ar-""' h^^^-TT-V"-" 48%HBr 
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Synthesis of 4-[(2-{ [4-chloro-3-(trifIuoromethyl)phenyl]amino}- 
IH-benzi midazol-6-vl')oxv1 -N-methvlDvridine-2-carboxamide 
The compound 4-[(2-{ [4-chloro-3-(trifluoromethyl)phenyl]amino }-lH-benzimidazol-6- 
yl)oxy]-N-methylpyridine-2-carboxamide (159322) was synthesized as follows: 




Step 1. Synthesis of 4-[(4-anuno-3-nitrophenyl)oxy]-iV-niethylpyridine-2-carboxamide: 
A mixture containing 4-amino-3-nitrophenol (leq) and potassium bis(trimethylsilyl)amide (2eq) 
was stirred in dimethylfdrmamide for 2 hours at room temperature. To this mixture was added 
(4-chloro(2-pyridyl))-iV-methylcarboxamade (leq) and potassium carbonate (1.2eq) and stirred at 
90**C for 3 days. The reaction mixture was then concentrated and partitioned between ethyl 
acetate and water. The organic layer was separated and washed with brine, dried, filtered, and 
concentrated in vacuirai to give brown solid. Purification on silica gel (2% triethyl amine / 50% 
ethyl acetate in hexane) gave 4-[(4-amino-3-nitrophenyl)oxy]-N-methylpyridine-2-carbbxamide . 
as an orange solid. The product gave satisfactory NMR. HPLC, 3.39min; MS: MH*^ = 289. 

Step 2. Synthesis of 4-[(3,4-dianiinophenyI)oxy]-iV-methylpyridine-2-carboxamide: The 
mixture containing [4-(3-amino-4-nitrophenoxy)(2-pyridyl)]-iV- in methanol with catalytic 
amount of 10%Pd/C was hydrogenated until disappearance of the yellow color to yield the 
product amine. HPLC, 2.5nflins; MS: MH^ = 259. 

Step 3. Synthesis of 4-[(2-{[4-chloro-3-(trifiuoromethyl)phenyl]amino}-lH- 
benzimidazol-6-yl)oxy]-N-methylpyridine-2-carboxamide: 

The mixture containing 4-[(3,4-diaminophenyl)oxy]-Ar-methylpyridine-2-carboxamide (leq) and 
4-chloro-3-(trifluoromethyl)benzeneisothiocyanate (leq) in tetrahydrofuran was stirred at room 
temperature for 16 hours to give the corresponding thiourea. To the resulting mixture was added 
l-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride (2eq) and the mixture was stirred 
for another 10 hours. The mixture was concentrated and partitioned between ethyl acetate and 
water. The organic layer was washed with brine and dried. Purification on HPLC gave 4-[(2- 
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{[4-chloro-3-(trifluoromethyl)phenyl]aniino}-lH-benziniidazol-6-yl)oxy]-N-methy^ 
carboxamide. MS: MH^ =462 

Synthesis of 4-({2-[(4-bromophenyl)amino]-l-methyl- 
lH-benziinidazol-5-vl)oxvVN-methvlpvridine-2-carfaoxamide 
5 The compound 4-({ 2-[(4-bromophenyl)aimno]-l-methyl-lH-benzimidazol-5-yl }oxy)-N- 

methylpyridine-2-caiboxanMde (161651) was synthesized as follows: 




Step 1. Synthesis of 4-{[3-ainino-4-(methylamino)phenyl]oxy}-iV-methylpyridine-2- 
carboxamide: A solution of 4-[(4-amino-3-nitrophenyl)oxy]-iV-methylpyridine-2-carboxamide 

10 (leq) in methylene chloride was treated with trifluoroacetic anhydride (leq) and stirred for 10 
minutes at 0 °C. The mixture was quenched with satd. NaHCOa solution. The organic layer was 
separated and washed with water, brine, dried and evaporated. MS: MEr=385.2 

To a solution of the trifluroacetamide (leq) in a mixture of toluene, acetonitrile and 
sodium hydroxide solution (50%) was added benzyltrimethylammonium chloride (leq) and 

15 dimethyl sulfate (1.2eq). The biphasic mixture was stirred overnight at room temperature and 
evaporated. The mixture was taken up in ethyl acetate, washed with water, brine, dried and 
evaporated. The crude product was purified by column chromatography eluting with 1 : 1 
hexanes and ethylacetate followed by 2% triethylamine in 1 : 1 hexanes and ethyl acetate followed 
by 2% triethylamine in 1:1 hexanes and ethyl acetate to afford iV-methyl-4-{[4-(methylamino)-3- 

20 nitrophenyl]oxy)pyridine-2-carboxamide as a reddish orange solid. MS: Mtt= 303.1. 

The solution of nitromethylaniline in methanol was treated with 5% palladium on carbon 
and stirred under hydrogen atmosphere for 15 min. (until the disappearance of yellow coloration) 
at room temperature. The mixture was filtered and the filtrate was concentrated to provide 0.36 
g of the diamine 4-{[3-amino-4-(methylamino)phenyl]oxy)-iV-methylpyridine-2-carboxamide. 

25 MS: MHr= 273.3. 

Step 2. Synthesis of 4-({2-[(4-bromophenyl)amino]-l-methyl-lH-benzimidazol-5- 
yl}oxy)-N-methylpyridine-2-carboxamide: A solution of the diamine 4-{[3-amino-4- 
(methylamino)phenyl]oxy}-JV-methylpyridine-2-carboxamide (leq) in methanol was treated with 
4-bromophenylisothiocyanate (leq) and stirred at 60 •»€ - 65''C for 2 houis. The reaction 

30 mixture was cooled down to room temperature and methyl iodide (leq) was added and stirred 
overnight at 60"'C. The reaction was cooled to room temperature, evaporated, taken up in ethyl 
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acetate, and washed with water and brine, dried, and evaporated under reduced pressure. 
Column chromatography using a gradient solvent system of hexanes and ethyl acetate and either 
1: 1 methylene chloride and acetone or 5% methanol in methylene chloride yielded the product as 
a half white powder. MS: MH^=452.3 

AMINOBENZIMIDAZOLYLQUINOLINONES 

Compounds of structure I may be synthesized from simple starting molecules as shown in 
Schemes 1-4 and exempUfied in the Examples: As shown in Scheme 1, compounds of structure 
I may generally be prepared using aromatic compounds substituted with amines and carboxylic 
acid groups. 

Schenie 2. 



H 

As shown in Scheme 2, a substituted aromatic compound such as a substituted or 
unsubstimted 2-aminobenzoic acid may be reacted with an acyl halide such as methyl 2- 
(chlorocarbonyl)acetate to produce an amide that will react with a substituted or unsubstituted 
1,2-diaminobenzene. The resulting product is a 4-hydroxy-substituted compound of structure I. 
One skilled in the art wiU recognize that the procedure set forth in Scheme 1 may be modified to 
produce various compounds. 

A method for preparing 4-amino substituted compounds of structure I is shown in 
Scheme 3. As shown in Scheme 3, aromatic compounds substituted with amine and nitrile 
groups may be used to synthesize 4-amino substituted compounds of structure I. A compound 
such as ethyl 2-cyanoacetate may be reacted with ethanol to produce ethyl 3-ethoxy-3- 
iminopropanoate hydrochloride. Subsequent reaction with a substituted or unsubstituted 1,2- 
phenylenediamine provides substituted or unsubstituted ethyl 2-benzimidazol-2-ylacetate. 
Reaction of a substituted or unsubstituted ethyl 2-benzimidazol-2-ylacetate with an aromatic 
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compound having an amine and nitrile group such as substituted or unsubstituted 2- 
aminobenzonitrile with a base such as lithium bis(trimethylsilyl)amide or a Lewis acid such as 
tin tetrachloride provides the substituted or unsubstituted 4-amino substituted compound of 
structure I. 

Scheme 3. HgN,^^^"' 

i? EtOH O NH-HCI HzN"^^ _J^^Y^ 



-CN 



NH2 ^ NH2 
LiHMDS 



SnCU 



Scheme ,4 illustrates a general synthetic route that allows for the synthesis of 4- 
dialkylamino and 4-alkylamino compounds of structure I. An inspection of Scheme 3 shows that 
4-hydroxy substituted compounds of stracture I may be converted into the 4-chloro derivative by 
reaction with phosphorous oxychloride or thionyl chloride. The 4-chloro derivative may then be 
reacted with an alkylamine or dialkylamine to produce the corresponding 4-alkylamino or 4- 
dialkylamino derivative. Deprotection affords the final 4-alkylamino or 4-dialkylamino . 
compounds of structure I. Other groups that may be reacted with the 4-chloK) derivative in this 
manner include, but are not limited to, ROH, RSH, and CuCN. 
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Scheme 4. h,n. 

I 




As shown in Scheme 5, the synthesis of compounds of structure I having a H, alkyl 
group, aryl group, or heterocyclyl group in the 4-position may be accomplished using a 
substituted or unsubstituted 2-benzimidazol-2-ylacetate prepared as shown in Schemes 3 and 4. 




R" = H, alkyl 

5 aryl, heterocyclyl 

TmOSEMCARBAZONES 

General procedure for the preparation of thiosemicarbazones 
Scheme 6 



H " 

A solution of aldehyde (1.0 equiv.) and thiosemicarbazide (1.05 equiv.) in acetic acid was 
stirred overnight. Excess of acetic acid was removed to give a residue, that was washed with 
ethanol, or purified by preparative-HPLC to give the thiosemicarbazone. 
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Scheme 7 

A solution of aldehyde (1.0 equiv.), thiosemicarbazide (1.05 equiv.) and acetic acid (0.1 
equiv.) in methanol was stirred overnight. Nfethanol was removed to give a residue, that was 
worked up as in Scheme 6. 
Scheme 8 

To a solution of {[(lE)-l-aza-2-(4-fluoro-3-nitrophenyl)vinyl]amino}-aminomethane-l- 
thione in ethanol was added an arylamine (2.1 equiv.). The solution was stirred at room- 
temperature until the starting fluoride disappeared. The solution was purified to the product. 




Scheme 9 

A mixture of 4-(diethylamino)-2-hydroxybenzaldehyde (1 equiv.), benzylic bromide (1.2 
equiv.) and powder potassium carbonate in ethanol was stirred at room temperature for 2 days. 
Ethanol was removed, and the residue was dissolved in ethyl acetate and water. The organic 
layer was washed with aqueous NaHCOs and brine, dried over Na2S04., and concentrated. The 
residue was purified on silica gel eluting with ethyl acetate/hexane to give 4-(diethylamino)-2- 
benzoxyhc-benzaldehyde. 

The aldehydes were converted to thiosemicarbazones according to Scheme 7. 

N 

S^N R 



Scheme 10 
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A solution of 3,4-difluorobenzenecarbomtrile (1 equiv.), amine (1.5 equiv.) andDIEA (2 
equiv.) in NMP was heated in a Smith Microwave (Personal Chemistry) for 30 minutes. The 
reaction mixture was purified on silica gel to give 4-substituted 3-fluorobenzenecarbbnitrile. 

To a solution of nitrile in toluene at -78 °C was added DIBAL-H (1 M in toluene, 1.5 
5 equiv.). The reaction mixture was warmed to rt, and stirred for 16 h, and quenched with 
methanol/ethyl acetate/brine (1:1:4). After being stirred at rt for 30 min, the solution was 
extracted with ethyl acetate (3x). The combined organic layers were washed with aqueous 
NaHCOs, brine and concentrated. The aldehyde was purified on silica gel or directly converted 
to thiosemicarbazones (Scheme 7). 
10 Scheme 11 

A solution of 2,4,5-trifluorobenzenecarbonitrile (1 equiv.) and 4-arylpiperazine (1.2 
equiv.) and DIEA (1.2 equiv.) in THF was heated at 80 °C for 2 hours. The mixture was purified 
on silica gel to give 4-substituted 2,5-difluorobenzenecarbonitrile. 
Scheme 12 

15 To an alcohol (1.0 equiv) was added potassium f-butoxide in THF (1 M, 1.1 equiv). 

After 5 minutes, the solution was added to a solution of 4-N-substituted-2,5- 
difluorobenzenecarbonitrile (1 equiv.) in THF. The reaction mixture was stirred at rt overnight 
and quenched with aqueous ammonium chloride. The aqueous layer was extracted with, ethyl 
acetate (3x). The combined organic layers were washed with brine, and concentrated to. give a 

20 residue, that was purified to give 4-N-substituted-2-0-substituted-5-fluorpbenzenecarbomtille. 

. 4-N-substituted-2-0-substituted-5-fluorobenzenecarbonitrile was reduced with DIBAL-H 
to give a 4-N-substituted-2-0-substituted-5-fluorobenzaldehyde according to procedure in 
Scheme 10. 

The aldehyde was converted to the corresponding thiosemicarbazone using Scheme 7. 
15 Scheme 13 

A solution of 4-N-substituted-2,5-difluorobenzenecarbonitiile (1 equiv.), amine (1.5 
equiv.) and DIEA (2 equiv.) in NMP was heated in a Smith Microwave (Personal Chemistry) for 
30 minutes. The reaction mixture was purified on silica gel to give 4-N-substituted-2-N- 
substituted-5-fluorobenzenecarbonitrile. 
50 4-N-substituted-2-N-substituted-5-fiuorobenzenecarbonitrile was reduced with DIBAL-H 

according to procedure described in Scheme 10 to give 4-N-substituted-2-N-substituted-5- 
fluorobenzaldehyde. 

Preparation of amino{ 3-[5-(3-chloiophenyl)(2-furyl)](2-pyrazolinyl) }methane-l-thione 
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To a solution of 5-(3-chlorophenyl)furan-2-carbaldehyde (1.0 equiv.) in TEIF at 0 "C was 
5 added MeMgBr in ether (3.0 equiv.) and stiired for 45 min. The reaction was quenched with 
water, diluted with ether and filtered through Celite. The organic layer was separated and 
washed with brine, dried over MgS04, and concentrated to give the l-[5-(3-chlorophenyl)-2- 
furyl]ethan-l-oi. 

To a solution of secondary alcohol(l .0 equiv.) in CH2CI2 was added MnOz (10 equiv.). 

10 The reaction was stirred overnight, filtered through CeUte, and concenti-ated to give l-[5-(3- 
chlorophenyl)-2-furyl]ethan-l-one. 

To a mixture of ketone (1.0 equiv.), paraformaldehyde (2.0 equiv.), and dimethylamine 
hydrochloride (2.0equiv) and molecular sieves in ethanol was added concentirated hydrochloric 
acid (cat.). The reaction was refluxed overnight under nitirogen and the concentiated. A few 

15 drops of HCl was added, and the mixture was worked up with DCM and water. The organic 
layer was discarded. The aqueous layer was adjusted to basic and extracted with DCM (3x). 
The organic layer was washed with brine, dried over MgS04, and concentirated to yield 3- 
(dimethylamino)-l-[5-(3-chlorophenyl)(2-furyl)]propan-l-one. 

Thiosemicarbazide (1 .0 equiv.) was dissolved in MeOH upon heating under nitirogen. 

20 Aqueous sodium hydroxide (6 M, 9.0 equiv.) was added to the reaction. A methanol solution of 
3-(dimethylamino)-l-[5-(3-chlorophenyl)(2-furyl)]propan-l-one (1.0 equiv) was tiien added 
dropwise to the reaction mixmre. The solvent was removed and the residue was dissolved in 
DCM and washed with water, brine, dried over MgS04, and concenti-ated. The final compound 
was purified by preparative-HPLC to give amino{3-[5-(3-chlorophenyl)(2-furyl)](2- 

25 pyrazoUnyl) }methane-l-thione; LC/MS m/z 306.2 (MH+); Rt =3.06 minutes . 
Scheme 14 




CI 
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To a solution of 4-pyridylmethylaimne (1.0 equiv.) and triettiylainine (2.0 equiv.) in 
CHCI3 was added CS2 (1.0 equiv.)) and stirred overnight. The reaction was cooled to 0 "^C and 
ethyl chloroformate (1.0 equiv.) was added dropwise. The reaction was stirred for 15 min at 0 "C 
and then stirred at room temperature for 2 hrs followed by addition of (tert- 
5 butyl)oxycarbohydrazide (1.2 equiv.). After stirring for an addition hour the mixture was 
washed with aqueous citric acid (5%), saturated NaHCOs, brine, dried over MgS04, and 
concentrated. The desired Boo protected thiosemicarbazide was purified using column - 
chromatography. 

To a solution of Boc protected thiosemicarbazide (1.0 equiv.) dissolved in DCM was 
10 added HCl in dioxane (2M, 8.3 equiv.) and stirrfed for 15 min. MeOH is then added to dissolve 
the precipitate, followed by addition of the furfural, and smaU amount of acetic acid (0.5 mL). 
The mixture is stirred overnight and the solvents are removed to give a residue purified by 
preparative-HPLC to give the thiosemicarbazone. 



Synthesis of 4-[4-(4-methylpiperazin-l-yl)phenoxymethyl]benzaldehyde 




To a solution of 4-piperazin-l-yl phenol (1 equivalent) in CHCI3, cooled to 0 °C, was 
added di-f-butyl dicarbonate (1 equivalent) in CHCI3 drop-wise. The solution was stirred at 0 °C 
for 1 hour before removing from the cold bath and stirring at ambient temperatures for 18 hours. 
20 The organic solution was washed aqueous NaHCOa and brine dried over MgS04 and 
concentrated the crude material was used without purification. 

A solution of the resulting 4-(l-BOC-piperazin-4-yl)phenol (1 equivalent) in dry CH3CN 
was slowly added drop- wise to a slurry of NaH (1 equivalent) in dry CH3CN at room 
temperature under N2. The slurry was stirred at room temperature for 2 hours before the solids 
25 were filtered and washed with EtaO. 

Boc-N^ ^N— ^^~^ONa ^ Boc-N^ 

Sodium 4-(l-BOC-piperazin-4-yl)phenoxide (1 equivalent) and methyl 4- 
bromomethylbenzoate (1 equivalent) were combined in dry acetone and heated to reflux at 60 °C 
for 18 hours. The slurry was filtered and the filtrate was then concentrated to provide the crude 
30 methyl 4-[4-(l-BOC-piperazin-4-yl)phenoxymethyl]benzoate, that was used without 
purification. 
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having sequence identity to an anuno add sequence selected ftom the g,x>up consisting of the 
sequences shown in Figure 127. and in particular SEQ ID NO^: 10506 to 10570. 

•me invention also provides fragments of amino acid sequences encoded by SEQ ID NO- 
10505. The invention also provides fragments of amino acid sequences selected from the g„,up 
consisting of SEQ ID NO- 10506 to 10570. one embodiment, the fragment does not conTisf 
entu^ly of a known amino acid sequence of a SARS virus or a known amino acid sequence of a 
coronavirus. 

m one embidiment. the invention includes a poIypq,tide comprising an amino acid 
^uence from the 53' Frame 3 of Figure 127. Some encoded open reading frames within flris 
translation are: SEQ ID NO: 10533; SEQ ID NO: 10571; SEQ ID NO: 10572; SEQ ID NO- 
10573; SEQ ID NO: 10574. 

•"•^i-vention includes a polypeptide sequence comprising an amino acid sequence 
selectedfromthegroupconsistingofSEQIDNO: 10533, SEQ ID NO: 10571 SEQ ID NO- 
10572. SEQ ID NO: 10573 and SEQ ID NO: 10574. lie invention includes a polypeptide ' 
havmg sequence identity to the amino acid sequence selected from the gtoup consisting of SEQ 
m NO: 10533, SEQ ID NO: 10571, SEQ ID NO: 10572. SEQ © NO: 10573 and SEQ ID NO 
10574, The invention includes a fragment of a polypeptide sequence comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 10533, SEQ ID NO 10571 SEO 
ID NO: 10572. SEQ ID NO: 10573 and SEQ ID NO: 10574. 

Partial BLAST results of SEQ ID NO: 10533 against GenBank are given below- 

>ai|7739601 gl)|aAF6a926.1|AF207902 11 ™ ■, . 

hepatitis virus strain ml-Ti} nucleocapsid protein [murine 

Length = 451 

J"""!.'. Expect . 3e-34 

ldent.t,es = 102/252 ,40%,. Positives - 137/252 ,54%,. oaps = 13/252 „», 
Ouer.: 49 ™H^™-^™™so™^^^^^ 

Quer.: 10, -^---Pva«^lv«™.^^ . 
.bjc:t. 123 VF™™o^vCOBlK<^^JoLrKXT«,;v=Sp^L^;;^,,^^ ,3, 
Querx: 167 -=-^«SHS.SC^SSRSSSRS™sTP<.SSRONSP«»«S^^^ 

Sbict: 183 Q«FVVEGSGRSAPASKS<,SRSQSRS™«H««,SS«QHQlL.VKPn>^Ki*L£;^ 242 

S.,=t= 243 K------...U---^oiLIvHjKll.^^-?p^°,^,^^^^^ 

Query: 283 NFGDQDLIRQgt 294 

NFG ++++ GT 
Sbjct: 291 NFGGSEMLKLGT 302 
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10 



30 



>gi|3132999|gb|AAC16422.1| nucleocapsid protein [murine hepatitis virus 
strain 2] 

Length = 451 
Score = 147 bits (370) , Esepect = 3e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%). Gaps = 18/252 (7%) 

SWFTJUiTQHGK-EEIJlFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 106 
SWF+ +3?Q K +E +F +GQGVPI + +Q GY+ R RR + DG+ K+L PRW 



YFYYLGTGP A YG + EG+VWVA++ A + R+P+++ A 



Query: 


49 


Sbjct : 


63 


Query t 


107 


Sbjct: 


123 


Query: 


167 


Sbjct: 


183 


Query: 


227 


Sbjct: 


243 


Query: 


283 


Sbjct: 


291 



+A L+L +Ij 



SGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 282 

GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 
-DAGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCFGKRGPNQ 290 



>gi|l27877|sp|p03417|NCAP_CVMJH Nucleocapsid protein 
gi|74859|pirl JVHIHMJ nucleocapsid protein - murine hepatitis virus 

(strain JHM) 

gi|58973|enib|CAA25497.l| nucleocapsid protein [Murine hepatitis virus] 
Length = 455 

Score = 146 bits (369) , Expect = 4e-34 

Identities = 110/254 (43%) , Positives = 142/254 (55%), Gaps = 22/254 (8%) 

SWFTALTQHGK-EELRFPRGQGVPINTOSGPDDQIGYYRRATRR-VRGGDGKMKELS PRW 106 
SWF+ +TQ K +E +F +GQGVPI Q GY+ R RR + DG+ K+L PRW 



YFYYLGTGP A YG + EG+VWVA++ A I R+P+++ A 



50 +GFY EGS G S +SRS SR SRG 



SGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQT 280 

GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 
-DAGQPKQ VTKQSAKEVRQKILNKPRQKRTPNKQCPVQQCFGKRGPNQ- 294 



Query: 


49 


Sbjct: 


67 


Query; 


107 


Sbjct: 


127 


Query: 


167 


Sbjct: 


187 


Query: 


225 


Sbjct: 


245 


Query: 


281 


Sbjct: 


295 



NFG ++++ GT 
-NFGGPEMLKLGT 306 
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>gi|6625766|gb|AAF19389.1|AF201929_7 nucleocapsid protein [murine 

hepatitis vianis strain 2] 

gi)7769348|gb|JVAF69338.l|AF208066_ll nucleocapsid protein [murine 

hepatitis virus] 

Length =451 

Score = 146 bits (368), Expect = 5e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 

SWFTi^TQHGK-EELRFPRGQGVPINTNSGPDDQIGYYIUlATRR-VRGGDGKMKELSPRW 106 
SWF+ +TQ K +E +F +GQGVPI + +Q GY+ R RR + DG+ K+L PRW - 

SWFSGITQFQKGKEFQFAQGQGVPIASGIPASEQKGYWYRHNRRSFKTPDGQHKQLLPRW 122 

YFYYLGTGPEASLPYGANKEGIVWVATEGAIJJTPKDHIGTRNPNNNI^ 166 
YFYYIiGTGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 



+A L+L +L 



Query: 


49 


Sbjct : 


63 


Query: 


107 


Sbjct: 


123 


Query: 


167 


Sbjct: 


183 


Query: 


227 


Sbjct: 


243 


Query: 


283 


Sbjct: 


291 



IKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 282 

GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 
>AGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCFGKRGPNQ- — 290- 



NFG ++++ GT 



>gi 1 21734854 | gb|AAM77005 . 1 1 AF481863_7 phosphorylated nucleocapsid ■• protein 
N [porcine hemagglutinating encephalomyelitis virus] 

Length =449 . 

Score = 145 bits (366), Expect = 8e-34 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) ' 

Query: 49 SWFTALTQHGK-EELRFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 106 

SWF+ +TQ K +E F GQGVPI + GY+ R RR + DG ++L PRW 

Sbjct: 64 SWFSGITQPQKGKEPEFAEGQGVPIAPGVPATEAKGYWYRHNRRSFKTADGNQRQLLPRW 123 

Query: 107 YFYYLGTGPEASLPYGANKEGTVWVATEGA-LNTPKDHIGTRNPNNNAATVLQLPQGTTL 165 

YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT L 
Sbjct: 124 YFYYLGTGPHAKHQYGTDIDGVFWVASNQADINTPAD-rVDRDPSSDEAIPTRFPPGTVL 182 

Query: 166 PKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRL 225 

P+G+Y EGS G S +SRS+SR+ N S SR NS R ++ G +A D++ 
Sbjct: 183 PQGYYIEGS-GRSAPNSRSTSRA-PNRAPSAGSRSRANSGNRTSTPGVTPDMA DQI 236 

Query: 226 NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 281 

L GK + Q VTK++A E + KPRQKR+ KQ V Q FG+RGP Q 

Sbjct: 237 ASLVLAKLGK-DATKPQQVTKQTAKEVRQKILNKPRQKRSPNKQCTVQQCFGKRGPNQ — 293 

Query: 282 GNFGDQDLIRQGT 294 

NFG ++++ GT 
Sbjct: 294 -NFGGGEMLKLGT 305 
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>gi|23295765 |gb|AAL80036.1| nucleocapsid protein [porcine 

hemagglutinating encephalomyelitis virus] 
Length = 449 

Score = 145 bits (365) , Expect = le-33 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) 



Query: 


49 


SWFTALTQHGK-EEI^PRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 


106 






SWP+ +TQ K +E P GQGVPI + 6Y+ R RR + DG PRW 




Sbjct: 


64 


SWFSGITQFQKGKEFEPAEGQGVPIAPGVPSTEAKGYWYRHNRRSFKTADGSNQRQLLPRW 


123 


Query: 


107 


YFYYIiGTGPEASLPYGANKEGIVWVATEGA-IjNTPKDHIGTRNPNNNAAT^ 


165 






YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT Lt 




Sbjct: 


124 


YFYYLGTGPHAKDQYGTDIIXSVFWVASNQADIlSrrPAD-IVDRDPSSDEAIPTRFPPGTVIi 


182 


Query: 


166 


PKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETAIJ^LLLLDRI. 


225 






P+G+Y EGS G S +SRS+SR+ N S • SR NS ' R ++ G +A D++ 




Sbjct: 


183 


PQGYYIEGS-GRSAPNSRSTSRA-PNRAPSAGSRSRANSOORTSTPGVTPDMA DQI 


236 


Query: 


226 


NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 


281 






L GK + Q VTK++A E + KPRQKR+ KQ V Q FG+RGP Q 




Sbjct: 


237 


ASLVXjAKLGK-DATKPQQVTKQTAKBVRQKIIjNKPRQKRSPNKQCTVQQCFGKRGPNQ-- 


293 


Query: 


282 


GJJFGDQDLIRQGT 294 








NFG ++++ GT 




Sbjct: 


294 


-NFGGGEMLKLGT 305 





These results indicate that SEQ ID NO: 10533 has functional similarities to a coronavirus 
nucleocapsid protein. 

In one embodiment, the invention comprises an amino acid sequence from the 5'3' Frame 1 
of Figure 127 e.g. SEQ ID NO^: 10506-10514. Some encoded open reading frames within this 
region are SEQ ID NO^: 10575 to 10578. 

Accordingly, the invention includes a polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO: 10575, SEQ ID NO: 10576, SEQ ID NO: 
10577 and SEQ ID NO: 10578. The invention includes a polypeptide comprising an amino acid 
sequence having sequence identity to a sequence selected from the group consisting of SEQ ID 
NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: 10578. The invention 
includes a fragment of a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: , 
10578. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from the 3'5' Frame 2 of Figure 127 e.g. SEQ ID NO^: 10547-10559. An open reading 
frame within this region is SEQ ID NO: 10579. 

The invention includes a polypeptide comprising an amino acid sequence of SEQ ID NO: 
10579. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 10579. The invention includes a fragment of a polypeptide 
comprising an amino acid sequence of SEQ ID NO: 10579. 
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The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and metiiods which can be used to diagnose 
or identify the presence or absence of a SARS vims in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
Table 33. The invention further includes polynucleotide sequence comprising the complement 
of one or more of the primer sequences identified in Table 33. 

The invention includes a polynucleotide sequence comprising SEQ ID NO: 11323. A 
polypeptide encoded by SEQ ID NO: 1 1323 is SEQ ID NO: 1 1324. 

The invention includes a polypeptide comprising SEQ ID NO: 1 1324, sequence having 
sequence identity to SEQ ID NO: 11324 andfiragments of SEQ ID NO: 11324. The invention 
includes a fragment of SEQ ID NO: 11324, wherein said polypeptide fragment begins with a 
Methionine. 

Accordingly, the invention includes a polynucleotide sequence comprising SEQ BO NO: 
11323. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
15 1 1323. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 1 1323. In one embodiment, tiie polynucleotide fragment does not consist entirely of a 
known SARS polynucleotide sequence or a known coronavirus polynucleotide sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence • 
SEQ ID NO: 1 1323, including the ainino acid sequence of SEQ ID NO: 1 1324. 
20 The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 11323. The invention provides amino acid sequences 
having sequence identity to SEQ ID NO: 1 1324. 

The invention provides fragments of amino acid sequences encoded by SEQ ID NO: 
11323. The invention also provides fragments of amino acid sequences of SEQ ID NO: 11324. 
25 In one embodiment, tiie fragment does not consist entirely of a known SARS amino acid 
sequence or a known coronavirus ainino acid sequence. 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify tfie presence or absence of a SARS virus in a biological sample. The invention 
30 includes a polynucleotide sequence comprising one or more of the primer sequences identified as 
SEQ ID NO^: 11325-11440 (left part) and SEQ ID NO^: 11441-11551 (right part). The invention 
further includes polynucleotide sequence comprising the complement of one or more of the 
primer sequences identified as SEQ ID NO^: 11325-11551. 

The invention includes a polypeptide comprising SEQ ID NO: 11552. The SARS virus 
contains polymorphism at die Isoleucine residue ne-324. The invention includes a polypeptide 
comprising an amino acid sequence having sequence identity to SEQ ID NO: 1 1552, wherein 
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said polypeptide includes an amino acid sequence selected from the group consisting of YSYAI 
(SEQ ID NO: 11553), SYAIH (SEQ ID NO: 11554), YAIHH (SEQ ID NO: 11555), ihhdk 
(SEQ ID NO: 11556). SYAI (SEQ ID NO: 11557). YAIH (SEQ ID NO: 11558). AIHH (SEQ ID 
NO: 11559). IHHD (SEQ ID NO: 11560), YAI, AIH. and IHH. The invention includes a 
5 fragment of a polypeptide comprising SEQ ID NO: 1 1552, wherein said fragment includes an 
amino acid sequence selected from the group consisting of YSYAI (SEQ ID NO: 11553), 
SYAIH (SEQ ID NO: 1 1554), YAIHH (SEQ ID NO: 1 1555). IHHDK (SEQ ID NO: 1 1556), 
SYAI (SEQ ID NO: 11557), YAIH (SEQ ID NO: 11558), AIHH (SEQ ID NO: 11559), IHHD 
(SEQ ID NO: 11560). YAI, AIH. and IHH. 

10 The invention includes a polypeptide comprising an amino acid sequence selected from the 

group consisting of SEQ ID NO: 11561 and SEQ ID NO: 11562. The invention includes a 
fragment of a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 11561 and SEQ ID NO: 11562. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 

15 the amino acid sequences selected from the group consisting of SEQ ID NO®: 1 1561 and 1 1562. 
The invention includes a diagnostic kit comprising a polynucleotide sequence encoding a 
polypeptide comprising at least one of the amino acid sequences selected from the group 
consisting of SEQ ID NO®: 11561 and 11562. The invention includes an immunogenic 
composition comprising a polypeptide comprising at least one of the amino acid sequences 

:0 selected from the group consisting of SEQ ID NO®: 11561 and 11562. The invention includes an 
antibody which recognizes a polypeptide comprising at least one of tiie amino acid sequences 
selected from the group consisting of SEQ ID NO®: 1 1561 and 1 1562. 

The invention includes a polynucleotide sequence SEQ ID NO: 1 1563 or a fragment 
thereof or a sequence having sequence identity thereto. Polypeptide sequences which can be 

5 tiranslated from SEQ ID NO: 1 1563 are shown in Figure 128. The constituent amino acid 

sequences from Figure 128, having at least 4 amino acids, are listed as SEQ ID NO®: 1 1564 to 
11617. 

The invention includes a polypeptide sequence selected from the group consisting of the 
sequences of Figure 128, or a fragment thereof or a sequence having sequence identity thereto 
3 e.^. SEQ ID NO®: 11563 to 11617. 

A polypeptide sequence witiiin SEQ ID NO: 11600 is SEQ ID NO: 11618. The invention 
includes a polypeptide comprising SEQ ID NO: 11618, or a fragment thereof or a sequence 
having sequence identity thereto. 
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A polypeptide sequence within SEQ ID NO: 11602 is SEQ ID NO: 11641. The invention 
includes a polypeptide comprising SEQ ID NO: 1 1641. or a fragment thereof or a sequence 
having sequence identity thereto. 

A polypeptide sequence within SEQ ID NO: 1 1609 is SEQ ID NO: 11619. 

The invention includes a polynucleotide encoding (i) an amino acid sequence selected from 
the group consisting of: (1) the amino acid sequences of Figure 128, and in particular SEQ ID 
NO^: 11564-11617: (2) SEQ ID NO: 11618; and (3) SEQ ID NO: 11619. or (ii) a fragment 
thereof. The invention includes a diagnostic kit comprising a one or more of these proteins. The 
invention includes a diagnostic kit comprising a polynucleotide sequence encoding one or more 
of these polypeptide sequences. The invention includes an antibody which recognizes one or 
more of the polypeptide sequences. 

The SARS virus may contain polymorphism at isoleucine residue Ile-326 in SEQ ID NO: 
11620 (Chi-PEP3). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11620, wherein said polypeptide includes an amino 
acid sequence selected from the group consisting of YAIHH (SEQ ID NO: 1 1621) and YATHH 
(SEQ ID NO: 1 1622). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11620, whereiii said fragment includes an amino acid sequence selected from the group 
consisting of YAIHH (SEQ ID NO: 1 1621) and YATHH (SEQ ID NO: 1 1622). 

, The SARS virus may contain polymorphism at glutamine residue Gln-830 in SEQ ID NO: 
11620. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 1 1620. wherein said polypeptide includes an amino acid 
sequence selected from tiie group consisting of ASQAW (SEQ ID NO: 11623) and ASRAW (SEQ 
ID NO: 11624). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
1 1 620, wherein said fragment includes an amino acid sequence selected from the group 
consisting of ASgAW (SEQ ID NO: 11623) and ASRAW (SEQ ID NO: 11624). 

The SARS virus may contain polymorphism at aspartic acid residue Asp-935 in SEQ ID 
NO: 11620. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 11620, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of DADST (SEQ ID NO: 1 1625) and DAYST (SEQ 
ID NO: 1 1626). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
11620. wherein said fragment includes an amino acid sequence selected from the group 
consisting of DADST (SEQ ID NO: 11625) and DAYST (SEQ ID NO: 11626). 

The SARS virus may contain polymorphism at serine residue Ser-577 in SEQ ID NO: 
11627 (Chi-PEP4). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11627, wherein said polypeptide includes an amino 
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acid sequence selected from the group consisting of PCSFG (SEQ ID NO: 11628) and pcafg 
(SEQ ID NO: 1 1629). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11627, wherein said fragment includes an amino acid sequence selected from the gioup 
consisting of PCSFG (SEQ ID NO: 11628) and PCAFG (SEQ ID NO: 11629). 

The SARS virus may contain polymorphism at valine residue Val-68 in SEQ ID NO: 
11630 (Chi-PEP8). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 1 1630, wherein said polypeptide includes an amino 
acid sequence selected from the group consisting of LAVVY (SEQ ID NO: 11631) and LAAVY 
(SEQ ID NO: 11632). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 1 1630, wherein said fragment includes an amino acid sequence selected from the group 
consistfng of LAVVY (SEQ ID NO: 11631) and LAAVY (SEQ ID NO: 11632). 

The SARS virus may contain polymorphism at isoleucine residue Ile-50 in SEQ ID NO: 
11633 (Chi-PEP13). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11633, wherein said polypeptide includes an amino 
acid sequence selected from the group consisting of NNXAS (SEQ ID NO: 11634) and NNTAS 
(SEQ ID NO: 11635). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11633, whierein said fragment includes an amino acid sequence selected from tiie group 
consisting of NNXAS (SEQ ID NO: 1 1634) and NNTAS (SEQ ID NO: 1 1635). 

The SARS vims may contain a polymorphism at Serine residue Ser-943 in SEQ ID NO: 
11636. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 11636, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of AVSAC (SEQ ID NO: 11637) and AV6AG (SEQ 
ID NO: 1 1638). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
1 1636, wherein said fragment includes an amino acid seuence selected from the group consisting 
of AVSAC (SEQ ID NO: 1 1637) and AVGAC (SEQ ID NO: 1 1638). 

The invention includes a polynucleotide SEQ ID NO: 11639, or a fragment thereof or a 
sequence having sequence identity thereto. The invention includes a polypeptide encoded by the 
polynucleotide sequence set forth in SEQ ID NO: 1 1639, or a fragment thereof or a polypeptide 
sequence having sequence identity thereto. 

The invention includes a polynucleotide set forth in SEQ ID NO: 1 1640, or a fragment 
thereof or a sequence having sequence identity thereto. The invention includes a polypeptide 
encoded by the polynucleotide sequence set forth in SEQ ID NO: 11640, or a fragment thereof or 
a polypeptide sequence having sequence identity thereto. 

The invention includes each of the polynucleotides identified above. The invention 
includes each of the polynucleotides set forth in the sequence listing. The invention further 
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includes polynucleotides having sequence identity to each of the polynucleotides identified 
above. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 61%, 62%, 
63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 
79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 
95%. 96%, 97%, 98%. 99% or more). 

The invention includes polynucleotide sequences comprising fragments of each of the 
polynucleotide sequences identified above. The fragments should comprise at least n 
consecutive polynucleotides from a particular SEQ ID NO:, and, depending on the sequence, n is 
7 or more (e.g., 7. 8. 9, 10, 11. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 
29, 30, 31. 32, 33. 34. 35, 36, 37, 38, 39. 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 14o'. 
. 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

The invention includes each of the aminp acid sequences encoded by each of the 
polynucleotide sequences identified above. The invention includes each of the amino acid 
sequences encoded by each of the polynucleotide sequences set forth in the sequence Usting. 
The invention further includes amino acid sequences having sequence identity to the amino acid 
sequences encoded by each of the polynucleotide sequences identified above. The degree of 
sequence identity is preferably greater than 50% (e.g. , 60%, 61 %, 62%. 63%, 64%, 65%, 66%, 
67%, 68%, 69%, 70%, 71%. 72%, 73%, 74%, 75%. 76%, 77%, 78%. 79%. 80%, 81%, 82%, 
83%, 84%, 85%. 86%, 87%. 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
99% or more). The invention further mcludes fragments of amino acid sequences encoded by 
each of the polynucleotide sequences identified above. The fragments should comprise at least n 
consecutive amino acids from a particular SEQ ID NO:, and, depending on the sequence, n is 7 
or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15. 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26. 27. 28. 29. 
30. 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120. 130, 140, 150,' 
160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

The invention includes each of the amino acid sequences identified above. The invention 
includes each of the amino acid sequence set forth in the sequence listing. The invention further 
includes amino acid sequences having sequence identity to each of the amino acid sequences 
identified above. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 
61%, 62%, 63%, 64%. 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 
77%. 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%. 89%, 90%, 91%. 92%, 
93%, 94%. 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fragments of the amino acid sequences identified above. 
The fragments should comprise at least n consecutive amino acids from a particular SEQ ID 
NO:, and, depending on the sequence, n is 7 or more (e.g., 7, 8, 9. 10. 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20. 21, 22, 23, 24, 25. 26, 27. 28. 29. 30. 31, 32. 33. 34. 35. 36. 37. 38. 39, 40. 45, 50. 55. 
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60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 160, 170, 180. 190, 200, 210, 220, 230, 240, 250, 
260, 270, 280, 290, 300 or marc). 

The invention includes polynucleotides encoding each of the amino acid sequences 
identified above. The invention includes polynucleotides encoding each of the amino acid 
sequences set forth in the sequence listing. The invention further includes polynucleotides 
having sequence identity with each of the polynucleotides encoding each of the amino acid 
sequences identified above. The degree of sequence identity is preferably greater than 50% (eg., 
60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fragments of polynucleotides encoding each of the amino 
acid sequences identified above. The fragments should comprise at least n consecutive 
polynucleotides from a particular SEQ ID NO:, and, depending on the sequence, n is 7 or more 
(e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 1.6_17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27. 28, 29, 30, 31, 
32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 160, 
170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

As described in more detail below, polynucleotides for use as primers and/or as probes 
may contain at least 4 or 8 contiguous nucleotides from a polynucleotide sequence of the 
invention e.g. at least 14, 15, 16, 17, 18. 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 
contiguous nucleotides and up to about 50, 75, 100, 200 contiguous nucleotides or more. While 
6-8 nucleotides may be a workable length, sequences of 10-12 nucleotides are preferred, and 
about 13, 14, 15, 16, 17, 18, 19, 20, or 21 or more nucleotides or more appears optimal for 
hybridisation. 

Li one embodiment, the invention is directed to polynucleotides and amino acid sequences • 
that do not consist entirely of a known SARS virus polynucleotide or amino acid sequenbe or of 
a known coronavirus polynucleotide or amino acid sequence. In one embodiment, the 
polynucleotides and amino acid sequences of the invention do not consist entirely of the 
sequence SEQ ID NO: 1. In anotiier embodiment, the polynucleotides and amino acid sequences 
of the invention do not consist entirely of the sequence SEQ ID NO: 2. SEQ ID NO: 9967 is a 
SARS genome sequence of the Frankfurt (FRA) isolate (GenBank: AY3 10120). Compared to 
SEQ ID NO: 1, it differs at nucleotides 2546, 2590, 11437, 18954, 19073, 20585, 20899, 23209, 
24922, 26589 & 28257; compared to SEQ ID NO:2, it differs at nucleotides 2560, 7922, 11451, 
16625, 18968 & 19067. Further genome sequences have become available from GenBank, since 
this application was originally filed, under accession numbers including AY559097, AY559096, 
AY559095, AY559094, AY559093, AY559092, AY559091, AY559090, AY559089, AY559088, 
AY559087. AY559086, AY559085, AY559084, AY559083, AY559082, AY559081, AY274119. 
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AY323977, AY291315, AY502932, AY502931, AY502930, AY502929, AY502928, AY502927, 
AY502926, AY502925, AY502924, AY502923, AY291451, AY390556, AY395003, AY395002. 
AY395001, AY395000, AY394999, AY394998, AY394997, AY394996, AY394995, AY394994, 
AY394993, AY394992, AY394991, AY394990, AY394989, AY394987, AY394986, AY394985, 
5 AY394983, AY394979. AY394978, AY508724, AY394850, AY463059, AY463060, AY313906, 
AY310120, AY461660, AY485278, AY485277, AY345988, AY345987, AY345986, AY282752, 
AY357076, AY357075, AY350750, AY304495, AY304488, AY304486, AY427439, AY283798, 
AY278491, AY278489, AY362699, AY362698, AY283797, AY283796, AY283795, AY283794^ 
AY278741, AY351680, AP006561. AP006560, AP006559, AP006558, AP006557, AY278554, 

iO AY348314, AY338175. AY338174, AY321118, AY279354. AY278490, AY278487. AY297028, 
AY278488, and NC_004718. 

In another embodiment, the invention is directed to polynucleotides that encode proteins 
which are not immunologically cross reactive with a protein of a mouse hepatitis virus, a bovine 
coronavirus or an avian infectious bronchitis vims. In another embodiment, the invention is 

15 directed to proteins which are not immunologically cross reactive with a protein of a mouse 
hepatitis virus, a bovine coronavirus or an avian infectious bronchitis virus. 

Each of the polynucleotides identified above may be used to encode a portion of a fusion 
protein. Accordingly, the invention complies one or more of the polynucleotides identified 
above wherein the polynucleotides encoding for the start codon are removed. The invention 

20 further comprises one or more of the amino acids identified above wherein the starting 

•; methionine is removed. 

Any of the polynucleotide or amino acid sequences discussed above may be used in 
vaccines for the tireatinent or prevention of SARS virus infection, including as a SARS viral 
antigen. Additionally, any of the polynucleotides or amino acid sequences discussed above may 

25 be used as diagnostic reagents, or in kits (comprising such reagents) or in methods used to 
diagnose or identify the presence or absence of a SARS virus in a biological sample. 

SARS viral antigens of the invention noay include a polypeptide vkdth 99%, 95%, 90%, 
85%, or 80% homology to one or more of the group consisting of the following proteins: 
nonstiiictural protein 2 (NS2); hemagglutinin-esterase glycoprotein (HE) (also referred to as E3), 

30 spike glycoprotein (S) (also referred to as E2), nonstructural region 4 (NS4), envelope (small 
membrane) protein (E) (also referred to as sM), membrane glycoprotein (M) (also referred to as 
El), nucleocapsid phosphoprotein (N) or RNA dependent RNA polymerase (pol). 

A detailed discussion of Coroavirus biology can be found in Fields Virology (2nd ed), 
Fields et al. (eds.), B.N. Raven Press, New York, NY., Chapter 35. 

35 Anotfier example of a SARS virus isolate is set forth in Example 1 below. The invention 

includes each of the polypeptide and polynucleotide sequences identified in Example 1. In 
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addition, the invention includes vaccine formulations comprising one or more of the polypeptide 
or polynucleotide sequences identified in Example 1. Thfe invention includes diagnostic 
legaents, Mts (comprising such reagents) and methods which can be used to diagnose or identify 
the presence or absence of a SARS virus in a biological sample using one or more of the 
5 polypeptide or polynucleotide sequences identified in Example 1 . The invention includes 

methods for the treatment or prevention of SARS virus infection utilizing small molecule viral 
inhibitors and combinations of small molecule viral inhibitors and kits for the treatment of 
SARS. The small molecule inhibitors may specifically target one or more of the polypeptides or 
polynucleotides identified in Example 1. 

10 Further discussion of terms used in the application follows below. 

*'Respiratory Virus" as used herein refers to a virus capable of infecting the human 
respiratory tract. Respiratory Viral Antigens suitable for use in the invention include Severe 
Acute Respiratory Syndrome virus, coronavirus, influenza virus, human rhinovirus (HRV), 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 

15 riiinovirus. 

The terms "polypeptide", "protein" and "amino acid sequence" as used herein generally 
refer to a polymer of amino acid residues and are not limited to a minimum length of the product. 
Thus, peptides, oligopeptides.jdimers, mulimers, and the Uke, are included within the definition. 
Both full-length proteins and fi:agments thereof are encompassed by the definition. Minimum 

20 fragments of polypeptides useftil in the invention can be at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, or even 15 amino acids. Typically, polypeptides useful in this invention can have a 
maximum length suitable for the intended application. Generally, the maximum length is not 
critical and can easily be selected by one skilled in the art. 

Polypeptides of the invention can be prepared in many ways e.g. by chemical synthesis (at 

25 least in part), by digesting longer polypeptides using proteases, by translation from RNA, by 
purification from cell culture {e.g. from recombinant expression), from the organism itself (e.g. 
after viral culture, or direct from patients), from a cell line souice etc. A preferred method for 
production of peptides <40 amino acids long involves in vitro chemical synthesis (Bodanszky 
(1993) Principles of Peptide Synthesis (ISBN: 0387564314); Fields et al. (1997) Methods in 

30 Enzymology 289: Solid-Phase Peptide Synthesis. ISBN: 0121 821900). Solid-phase peptide 
synthesis is particularly preferred, such as methods based on t-Boc or Fmoc (Chan & White 
(2000) Fnioc Solid Phase Peptide Synthesis ISBN: 0199637245) chemistry. Enzymatic synthesis 
(KuUmann (1987) Enzymatic Peptide Synthesis. ISBN: 0849368413) may also be used in part or 
in full. As an alternative to chemical synthesis, biological synthesis may be used e.g. the 

J5 polypeptides may be produced by translation. This may be carried out in vitro or in vivo. 

Biological methods are in general restricted to the production of polypeptides based on L-amino 
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acids, but manipulation of translation machinery ie.g. of aminoacyl tRNA molecules) can be 
used to allow the introduction of D-amino acids (or of other non natural amino acids, such as 
iodotyrosine ormethylphenylalanine, azidohomoalanine, etc.) (Ibba (1996) Biotechnol Genet 
Eng Rev 13:197-216.). Where D-amino acids are included, however, it is preferred to use 
chemical synthesis. Polypeptides of the invention may have covalent modifications at the 
C-terminus and/or N-terminus, particularly where they are for in vivo administration e.g hy 
attachment of acetyl or carboxamide, as in the Fuzeon™ product. 

Reference to polypeptides and the like also includes derivatives of the amino acid 
sequences of the invention. Such derivatives can include postexpression modifications of the 
polypeptide, for example, glycosylation, acetylation, phosphorylation, and the like. Amino acid 
derivatives can also include modifications to the native sequence, such as deletions, additions 
and substitutions (generally conservative in nature), so long as the protein maintains the desired 
activity. These modifications may be deUberate, as through site-directed mutagenesis, or may be 
accidental, such as through mutations of hosts which produce the proteins or errors due to PGR 
amplification. Furthermore, modifications may be made that have one or more of the following 
effects: reducing toxicity; facilitating cell processing {e.g., secretion, antigen presentation, etc.); 
and facilitating presentation to B-cells and/or T-cells. 

"Fragment" or "Portion" as used herein refers to a polypeptide consisting of only a part of 
the intact full-length polypeptide sequence and structure as found in nature. For instance, a 
fragment can include a C-tenninal deletion and/or an N-terminal deletion of a protein. 

A "recombinant" protein is a protein which has been prepared by recombinant DNA 
techniques as described herein. In general, the gene of interest is cloned and then expressed in 
transformed organisms, as described further below. The host organism expressed the foreign 
gene to produce the protein under expression conditions. 

The term "polynucleotide", as known in the art, generally refers to a nucleic acid molecule. 
A "polynucleotide" can include both double- and single-stranded sequences and refers to, but is 
not limited to, cDNA firom viral, prokaryotic or eukaryotic mRNA, genomic RNA and DNA 
sequences fixjm viral (e.g. RNA and DNA viruses and retroviruses) or prokaryotic DNA, and 
especially synthetic DNA sequences. The term also captures sequences that include any of the 
known base analogs of DNA and RNA, and includes modifications such as deletions, additions 
and substitutions (generally conservative in nature), to the native sequence, so long as the nucleic 
acid molecule encodes a therapeutic or antigenic protein. These modifications may be 
deKberate, as through site-durected mutagenesis, or may be accidental, such as through mutations 
of hosts that produce the antigens. Modifications of polynucleotides may have any number of 
effects including, for example, facilitating expression of the polypeptide product in a host cell. 
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Polynucleotides of the invention may be prepared in many ways e.g. by chemical synthesis 
(e.g. phosphoramidite synthesis of DNA) in whole or in part, by digesting longer nucleic adds 
using nucleases (e.g. restriction enzymes), by joining shorternucleic acids or nucleotides (e.g. 
using ligases or polymerases), from genomic or cDNA libraries, etc. 

A polynucleotide can encode a biologically active (e.g., immunogenic or therapeutic) 
protein or polypeptide. Depending on the nature of the polypeptide encoded by the 
polynucleotide, a polynucleotide can include as little as 10 nucleotides, e.g., where the 
polynucleotide encodes an antigen. 

By "isolated" is meant, when referring to a polynucleotide or a polypeptide, that the 
indicated molecule is separate and discrete from the whole organism with which the molecule is 
found in nature- or, when the polynucleotide or polypeptide is not found in nature, is sufficiently 
free of other biological macromolecules so that the polynucleotide or polypeptide can be used for 
its intended purpose. The polynucleotides and polypeptides of the invention are preferably 
isolated polynucleotides and isolated polypeptides. 

"Antibody" as known in the art includes one or more biological moieties that, through 
chemical or physical means, can bind to or associate with an epitope of a polypeptide of interest. 
The antibodies of the invention include antibodies which specifically bind to a SARS viral 
antigen. The term "antibody" includes antibodies obtained from both polyclonal and monoclonal 
preparations, as well as tiie following: hybrid (chimeric) antibody molecules (see, for example. 
Winter et al. (1991) Nature 349: 293-299; and US Patent No. 4,8 16,567; F(ab')2 and F(ab) 
fragments; Py molecules (non-covalent heterodimers, see, for example, Inbar et al (1972) Proc 
Natl Acad Sci USA 69:2659-2662; and Ehrlich et al. (1980) Biochem 19:409 1-4096); single- 
chain Fv molecules (sFv) (see, for example, Huston et al. (1988) Proc Natl Acad Sci USA 
85:5897-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, e.g.^ Pack 
et al (1992) Biochem 31:1579-1584; Cumber et al (1992) J Immunology 149B: 120-126); 
humanized antibody molecules (see, for example, Riechmann et al (1988) Nature 332:323-327; 
Verhoeyan et al. (1988) Science 239:1534-1536; and U.K. Patent Publication No. GB 2,276,169, 
published 21 September 1994); and, any functional fragments obtained from such molecules, 
wherein such fragments retain immunological binding properties of the parent antibody 
molecule. The term "antibody" further includes antibodies obtained through non-conventional 
processes, such as phage display. 

As used herein, the term "monoclonal antibody" refers to an antibody composition having 
a homogeneous antibody population. The term is not limited regarding the species or source of 
the antibody, nor is it intended to be limited by the manner in which it is made. Thus, the term 
encompasses antibodies obtained from murine hybridomas, as well as human monoclonal 
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aitibodies obtained using human rather than murine hybridomas. See, e.g.. Cote, et al. 
Monoclonal Antibodies and Cancer Therapy, Alan R. liss, 1985, p 77. 

An "immunogenic composition" as used herein refers to a composition that comprises an 
antigenic molecule where administration of the composition to a subject results in the 
development in the subject of a humoral and/or a cellular immune response to the antigenic 
molecule of interest. The immunogenic composition can be introduced directly into a recipient 
subject, such as by injection, inhalation, oral, intranasal or any other parenteral, mucosal or 
transdermal {e.g.. intra-rectally or intra-vaginally) route of administration. 

The term "derived from" is used to identify the source of molecule {e.g.. a molecule can be 
derived from a polynucleotide, polypeptide, an immortalized cell line can be derived from any 
tissue, etc.). A first polynucleotide is "derived from" a second polynucleotide if it has the same 
or substantially the same basepair sequence as a region of the second polynucleotide, its cDNA, 
complements thereof, or if it displays sequence identity as described above. Thus, a first 
polynucleotide sequence is "derived from" a second sequence if it has (i) the same or 
substantially the same sequence as the second sequence or (ii) displays sequence identity to 
polypeptides of that sequence. 

A first polypeptide is "derived from" a second polypeptide if it is (i) encoded by a firet 
polynucleotide derived from a second polynucleotide, or (ii) displays sequence identity to the 
second polypeptides as described above. Thus, a polypeptide (protein) is "derived from" a 

particular SARS virus if it is (i) encoded by an open reading frame of a polynucleotide of that 
SARS virus, or (ii) displays sequence identity, as described above, to polypeptides of that SARS 



virus 



Both polynucleotide and polypeptide molecules can be physically derived from a SARS 
virus or produced recombinantly or synthetically, for example, based on known sequences. 

A cultured cell or cell line is "derived from" another cell, cells or tissue if it is originaUy 
obtained from existing cells or tissue. Non-limiting examples of tissue that ceUs may be derived 
from include skin, retina, Uver, kidney, heart, brain, muscle, intestinal, ovary, breast, prostate, 
cancerous tissue, tissue infected with one or more pathogens (e.g., viruses, bacteria etc.) and the 
like. The cells described herein may also be derived from other ceUs including, but not limited 
to, primary cultures, existing immortalized cells line and/or other isolated cells. 

An "antigen" refers to a molecule containing one or more epitopes (eitiier linear, 
conformational or both) that will stimulate a host's immune system to make a humoral and/or 
cellular antigen-specific response. The term is used interchangeably with the term 
"immunogen." Normally, an epitope will include between about 3-15, generally about 5-15 
amino acids, A B-cell epitope is normally about 5 amino acids but can be as small as 3-4 amino 
acids. A T-cell epitope, such as a CTL epitope, will include at least about 7-9 amino acids, and a 
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helper T-cell epitope at least about 12-20 amino acids. Nonnally, an epitope will include 
between about 7 and 15 amino acids, such as, 9, 10, 12 or 15 amino acids. The tenn "antigen" 
denotes both subunit antigens, antigens which are separate and discrete from a whole 
organism with which the antigen is associated in nature), as well as, killed, attenuated or 
inactivated bacteria, viruses, fungi, parasites or other microbes as well as tumor antigens, 
including extracellular domains of cell surface receptors and intracellular portions that may 
contain T-cell epitopes. Antibodies such as anti-idiotype antibodies, or ftagments thereof, and 
synthetic peptide mimotopes, which can mimic an antigen or antigenic determinant, are ^so 
captured under the definition of antigen as used herein. Similarly, an oligonucleotide or 
polynucleotide that expresses an antigen or antigenic determinant in vivo, such as in gene therapy 
and DNA immunization appUcations, is also included in the definition of antigen herein. 

An "immunological response" to an antigen or composition is the development in a 
subject of a humoral and/or a cellular immune response to an antigen present in the composition 
of interest. For purposes of the present invention, a "humoral immune response" refers to an 
immune response mediated by antibody molecules, including secretory agA) or IgG molecules, 
while a "cellular immune response" is one mediated by T-lymphocytes and/or other white blood 
ceUs. One important aspect of cellular immunity involves an antigen-specific response by 
cytolytic T-cells ("CTL"s). CTLs have specificity for peptide antigens that are presented in 
association with proteins encoded by the major histocompatibility complex (MHC) and 
expressed on the surfaces of cells. CTLs help induce and promote the destruction of intracellular 
microbes, or the lysis of cells infected with such microbes. Another aspect of cellular immunity . 
involves an antigen-specific response by helper T-cells. Helper T-ceUs act to help stimulate the 
function, and focus the activity of, nonspecific effector cells against cells displaying peptide 
antigens in association with MHC molecules on their surface. A "cellular immune response" 
also refers to the production of cytokines, chemokines and other such molecules produced by 
activated T-cells and/or other white blood ceUs, including those derived from CD4+ and CD8+ 
T-cells. In addition, a chemokine response may be induced by various white blood or 
endothelial cells in response to an administered antigen. 

//. VACCINE FORMULATIONS 

The invention relates to vaccine formulations for the treatment or prevention of Severe 
Acute Respiratory Syndrome (SARS). Vaccine formulations of the invention include an 
inactivated (or killed) SARS virus, an attenuated SARS virus, a split SARS vkus preparation and 
a recombinant or purified subunit formulation of one or more SARS viral antigens. The 
invention includes polypeptides and polynucleotides encoding for SARS viral antigens and 
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immunogenic fragments hereof. Expression and deHvery of the polynucleotides of the invention 
may be facilitated via viral vectors and/or viral particles, including Virus Like Particles (VLPs). 
A. Inactivated (nr Killed) SARS Vaccines 

The invention includes a composition comprising an inactivated (or killed) SARS virus and 
methods for the production thereof. Inactivated SARS viral compositions can be used as 
prophylactic or therapeutic SARS virus vaccine. Preferably the inactivated SARS virus vaccine 
composition comprises an amount of inactivated SARS virus which, before inactivation.is 
equivalent to a virus titer of from about 4 to 7 logs plaque forming units (PFU) or 4 to 7 logs 
tissue culture infectious dose 50 (TCID50) per milliliter. More preferably, before inactivation the 
virus titer is from 4 to 11, 7 to 11 or 9 to 11 PFU or TCID50. Still more preferably the 
inactivated SARS virus vaccine composition comprises an amount of inactivated SARS virus 
which, before inactivation, is equivalent to a virus titer of from about 5 to 9 PFU or 5 to 9 
TCID50 per milliliter. In one embodiment, the PFU or TCID50 of the cultured SARS virus at 
harvest is 6 to 8, more preferably about 7.5 PFU or TCID50 per milliUter. Upon concentration of 
the viral harvest, the PFU or TCID50 is preferably 8 to 11. still more preferably about 9 PFU or 
TCIDso per miUiUter. The vaccine composition comprises a sufficient amount of the SARS virus 
antigen to produce an immunological response in a primate. 

Methods of inactivating or killing viruses are known in the art to destroy the ability of the 
viruses to infect mammalian cells. Such methods include both chemical or physical means. 
Chemical means for inactivating a SARS virus include treatment of the vims with an effective 
amount of one or more of the following agents: detergents, formaldehyde, formalin, ^- 
propiolactone, or UV Ught. Additional chemical means for inactivation include treatment with 
methylene blue, psoralen, carboxyfullerene(C60) or a combination of any thereof. Other 
methods of viral inactivation are known in the art, such as for example binary ethylamine, acetyl 
ethyleneimine, or gamma irradiation. 

For example formaldehyde may be used at concentrations such as 0.1 to 0.02%, preferably 
at 0.02 to 0.1 %, and still more preferably at 0.04 to 0.05%. The inactivating agent is added to 
virus containing culture supematants prior to or after harvesting said culture supematants from 
vessels used for virus propagation, either with or without a step of cell disruption for release of 
cell-associated virus prior to harvesting. Further, the inactivating agent may be added after said 
culture supematants have been stored frozen and thawed, or after one or more steps of 
purification to remove cell contaminants. Preferably, however, formaldehyde is added after 
removal of cells and cellular debris or after one or more purification steps. After addition of 
formaldehyde, the virus containing mixture is transferred into an incubation vessel and incubated 
at refrigeration temperatures (e.g. +2 to S^C) or alternatively at elevated temperatures, such as 
ambient temperatures between approximately 20 and 30°C or at SS^C to 37''C for a period of 12 
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hours to 7 days, whereby the temperature chosen should be adjusted to the duration of 
incubation. Plefered conditions are e.g. +2 -S°C for 3-7 days (prefered are 3 -7days), ambient 
temperatures and incubation for 16 hours to 3 days (prefered 24- 48 hours), or 35-37°C for 12-36 
hours. If it is desirable to remove excess formalin, sodium thiosulfate or sodium metabisulfite at 
equimolar or 1.5 -fold molar concentration (relative to formaldehyde) may be added after 
completing the inactivation process. 

For example, p-propiolactone may be used at concentrations such as 0.01 to 0.5%, 
preferably at 0.5% to 0.2%, and still more preferably at 0.025 to 0.1%. The inactivating agent is 
added to virus containing culture supematants (virus material) prior to or after harvesting said 
culture supematants from vessels used for virus propagation, either with or without a step of cell 
disruption for release of cell-associated virus prior to harvesting. Furtiher, the inactivating agent 
may be added after said culture supematants have been stored frozen and thawed, or after one or 
more steps of purification to remove cell contaminants. P-propiolactone is added to the virus 
material, with the adverse shift in pH to acidity being controlled with sodium hydroxide (e.g., 1 
N NaOH), a Tris-buffer or sodium bicarbonate solution. After transfering the mixture to another . 
inactivation vessel, the combined inactivating agent-virus materials are incubated at temperatures 
from 4°C to 37°C, for incubation times of preferably 24 to 72 hours. 

Another inactivant which may be used is binary ethyleneimine (BEI). Equal volumes of a 
0.2 molar bromoethylamine hydrobromide solution and a 0.4 molar sodium hydroxide solution 
are mixed and incubated at about 37*'C. for 60 minutes. The resulting cyclized inactivant is 
binary ethyleneimine, which is added to the virus materials at 0.5 to 4 percent, and preferably at 
1 to 3 percent, volume to volume. The inactivating virus materials are held from about 4°C to 
37°C for 24 to 72 hours with periodic agitation. At the end of this incubation 20 ml. of a sterile 1 ' 
molar sodium thiosulfate solution was added to insure neutralization of the BEI. 

In one embodiment, the invention includes an inactivating method is designed to maximize 
exposure of the virus to the inactivating agent and to minimize long-term exposure of the 
temperamre sensitive SARS virus particles to elevated temperatures. The invention includes an 
inactivation method comprising exposing the virus to the inactivation agent (such as BPL) for 12 
to 24 hours at refrigeration temperatures followed by hydrolysis of any residual inactivating 
agent by elevating the temperature for only 3 hours. Preferably, the refrigeration temperatures 
are between 0 and S^C, more preferably around 4°C. Preferably, the elevated temperature is 
between 33 and 4l°C, more preferably around 37°C. As assessed by a test for residual infectious 
virus using 10 ml aliquots of the inactivated preparation, the method is able to inactivate SARS- 
CoV in raw cell culture harvests below a theoretical limit of 0.03 infectious units/ml. 

Diluted and undiluted samples of the inactivated virus materials are added to susceptible 
cell (tissue) culture (e.g., VERO) to detect any non-inactivated virus. The cultured cells are 
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passaged multiple times and examined for the presence of SARS vims based on any of a variety 
of methods, such as. for example, cytopathic effect (CPE) and antigen detection (e.g., via 
fluorescent antibody conjugates specific for SARS virus). Such tests allow determin^ion of 
complete virus inactivation. 

Prior to inactivation, the SARS virus will be cultured in a mammaUan ceU culture. The ceU 
culture may be adherently growing cells or ceUs growing in suspension. Preferably the ceUs are 
of mammalian origin, but may also be derived from avian (eg., hens' ceUs such as hens' embryo 
ceUs (CEF cells)), amphibian, reptUe, insect, or fish sources. MammaUan sources of ceUs 
include, but are not limited to, human or non-human primate (e.g.. MRC-5 (ATCC CCL-171), 
WI-38 (ATCC CCL-75), HeLa ceUs, human diploid cells, fetal rhesus lung cells (e.g. ATCC CL- 
160). human embryonic kidney cells (293 cells, typically transformed by sheared adenovirus 
type 5 DNA). VERO cells (e.g., from monkey kidneys), horse, cow (e.g.. MDBK cells), sheep, 

dog (e.g., MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016. 
deposit number DSM ACC 2219 as described in WO 97/37001), cat, and rodent (e g hamster 
cells such as BHK21-F, HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be 
obtained from a wide variety of developmental stages, including for example, adult, neonatal, 
fetal, and embryo. 

In certain embodiments the cells are immortahzed (e.g., PERC.6 cells are described for 
example, in WO 01/38362 and WO 02/40665, incorporated by reference herein in their ' 
entireties, as well as deposited under ECACC deposit number 96022940), or any other cell type 
immortalized using the techniques described herein. 

In preferred embodiments, mammalian cells are utilized, and may be selected from and/or 
derived from one or more of the following non-hmiting cell types: fibroblast cells (e.g., dermal, 
lung), endothelial cells (e.g., aortic, coronary, pulmonary, vascular, dermal microvascular, ' - 
umbilical), hepatocytes, keratinocytes, immune cells (e.g.. T cell, B cell, macrophage. NK. 
dendritic), mammary cells (e.g., epithelial), smooth muscle cells (e,g., vascular, aortic, coronary, 
arterial, uterine, bronchial, cervical, retinal pericytes), melanocytes, neural cells (e.g., astrocytes) 
prostate cells (e.g., epitheUal, smooth muscle), renal ceHs (e.g., epithelial, mesangial, proximal 
tubule), skeletal cells (e.g., chondrocyte, osteoclast, osteoblast), muscle cells (e.g., myoblast, 
skeletal, smooth, bronchial), liver cells, retinoblasts, and sti-omal cells. WO 97/37000 and WO 
97/37001, incorporated by reference herein in their entireties, describe production of animal cells 
and ceU hnes that capable of growth in suspension and in serum free media and are useful in tiie 
production and replication of viruses. 

Preferably, the SARS viruses of the invention are grown on VERO cells or fetal rhesus 
kidney cells. 
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Culture conditions for the above cell types are well-described in a variety of publications, 
or alternatively culture medium, supplements, and conditions may be purchased commercially, 
such as for example, as described in the catalog and additional literature of Cambrex Bioproducts 
(East Rutherford, NJ). 

5 In certain embodiments, the host cells used in the methods described herein are cultured in 

serum free and/or protein free media. A medium is referred to as a serum-free medium in the 
context of the present invention in which there are no additives from seruin of human or animal 
origin. Protein-free is understood to mean cultures in which multiplication of the cells occurs 
with exclusion of proteins, growth factors, other protein additives and noii-serum proteins. Hie 

10 cells growing in such cultures naturally contain proteins themselves. 

Known serum-free media include Iscove's medium, Ultra-CaiO medium (BioWhittaker) or 
EX-CELL (JRH Bioscience). Ordinary serum-containing media include Eagle's Basal Medium 
(BME) or Minimum Essential Medium (MEM) (Eagle, Science, 130, 432 (1959)) or Dulbecco's 
Modified Eagle Medium (DMEM or EDM), which are ordinarily used with up to 10% fetal calf 

15 serum or similar additives. Optionally, Minimum Essential Medium (MEM) (Eagle, Science, 

130, 432 (1959)) or Dulbecco's Modified Eagle Medium (DMEM or EDM) may be used without 
any serum containing supplement. Protein-free media like PF-CHO (JHR Bioscience), 
chemically-defined media like ProCHO 4CDM (BioWhittaker) or SMIF 7 (Gibco/BRL Life 
Technologies) and mitogenic peptides like Primactone, Pepticase or HyPep™ (all froin Quest 

iO International) or lactalbumin hydrolyzate (Gibco and other manufacturers) are also adequately 
known in the prior art. The media additives based on plant hydrolyzates have the special 
advantage that contamination with viruses, mycoplasma or unknown infectious agents can be 
ruled out. 

. The cell culture conditions to be used for the desired application (temperature, cell density, 
15 pH value, etc.) are variable over a very wide range owing to the suitability of the cell line 

employed according to the invention and can be adapted to the requirements of the S ARS virus. 

The method for propagating the SARS virus in cultured cells {e.g., mammalian cells) 
includes the steps of inoculating the cultured cells with SARS virus, cultivating the infected cells 
for a desired time period for virus propagation, such as for example as determined by SARS 
10 virus titer or SARS virus antigen expression {e.g., between 24 and 168 hours after inoculation) 
and collecting the propagated virus. The culmred cells are inoculated with a SARS virus 
(measured by PFU or TCID50) to cell ratio of 1:10000 to 1:10. A lower range of ratios may also 
be used e.g. 1:500 to 1:1, preferably 1:100 to 1:5, more preferably 1:50 to; 1:10. The SARS virus 
is added to a suspension of the cells or is applied to a monolayer of the cells, and the virus is 
i5 absorbed on the cells for at least 60 minutes but usually less than 300 minutes, preferably 
between 90 and 240 minutes at 25°C to 40**C, more preferably 28°C to 37°C, still more 

-108- 



BNStXaCID: <WO___2004092360A2_I_> 



wo 2004/092360 PCrAJS2004Alll710 

preferably at about 33 °C. The itifected ceJl culture (e.g.. monolayers) may be treated either by 
fteeze-thawmg or by enzymatic action to mcrease the viral content of the harvested cultute 
snpematants. The harvested fluids are then either inactivated or stored frozen. 

A comparison of SARS infected Vero ceUs gmwn with and v^ithout fetal calf serum 
("FCS") is shown in FIGURE 26A. Briefly. Vero cells were split the day before infection and 

cultivated in T175 flasks, hrfection of 90% confluent Vero cell monolayers the following day 
was performed with a SARS-CoV seed stock (strain fra. passage 4. Accession number 
AY310120), with or Without 3% FCS (Fig. 26A). The addition of FCS to the cell media'showed 
Kttle impact on virus yield. 

Cultured ceUs may be infected at a multiplicity of infection ("m.o.i.") of about 0.0001 to 
10. preferably 0.002 to 5. more preferably to 0.001 to 2. Still more preferably, the cells are 
^XlT.r' ^ "^'^^'^ yield at varying m.o.i. levels is .howniu 

Infected cells may be harvested 30 to 60 hours post infection. Preferably, the cells are 
harvested 34-18 hours post infection. Still more preferably, the cells are harvested 38 to 40 
hours post infection. See FIGURE 26C. 

Methods of puriflcaUon of inactivated virus are known in the art and may include one or 
more of. for instance gradient centrifugation, ultracentrifugation, continuous-flow 
ultracentrifugation and chromatography, such as ion exchange chromatography, size exclusion 
chron^ography.andUquidafBmty chromatography. Additional method 
ultrafiltratton and dialfiltmtion. See JP Gregersen "Herstellung von Vitussimpfstoffen aus 
Zellfcultnren" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. Kayser and RH Mueller) 
WlssenschaftBche VerlagsgeseUschaft. Stuttgart. 2000. S^eafao.O-Neil^aZ "Virus 
Harvesting and Affinity Based Liquid Chromatography. A Method for Virus Concentration and . 
Pmtfcation-. Biotechnology (1993) 11:173-177; Prior c al.. "Process Development for 

Manufacture of Inactivated mV-l-.Phannaceuttcal Technology (1995) 30-52; and Majhdi«<rf 
Isolation and Characterization of a Coronavirus from Elk Calves with diarrhea" Journal of 
Cluneal Microbiology (1995) 35(11): 2937-2942. 

Other examples of purification methods suitable for use in the invention mclude 
polyethylene glycol or ammonium sulface precipitation ^see Ttepanier e, al. "Concentration of 
hum» respiratory syncytial virus using ammomum sulfate, polyethylene glycol or hollow fiber 
ult^tration" Journal of Virological MeUiods (1981)3(4):201-2n; Hagen e.al.. "Optimization 
of Poly(ethylene glycol) Precipitation of Hepatitis Virus Used to prepare VAQTA a Highly 
Punfied Inactivated Vaccine" Biotechnology Progress (1996) 12:406-412; and Caisson e, al 
•Purification of Infectious Pancreatic Necrosis Virus by Amon Exchange Chromatography " 
Increases tiie Specific Infectivity" Journal of Virological MeUiods (1994) 47:27-36) as weU as 
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ultrafiltration and microfiltration (see Pay et ah. Developments in Biological Standardization 
(1985) 60:171-174; Tsummi et ah, "Structure and filtration performances of improved 
cuprammonium regenerated cellulose hollow fibre (improved BMM hollow fibre) for virus 
removal" Polymer Journal (1990) 22(12): 1085-1100; and Makino et aL, "Concentration of live 
retrovirus with a regenerated cellulose hoUow fibre, BMM*', Archives of Virology (1994) 139(1- 
2): 87-96.). 

Preferably, the virus is purified using chromatography, such as ion exchange 
chromatography. Chromatic purification allows for the production of large volumes of virus 
containing suspension. The viral product of interest can interact with the chromatic medium by a 
simple adsorption/desorption mechanism, and large volumes of sample can be processed in a 
single load. Contaminants which do not have affinity for the adsorbent pass through the column. 
The virus material can then be eluted in concentrated form. 

Preferred anion exchange resins for use in the invention include DEAE, EMD TMAE. 
Preferred cation exchange resins may comprise a sulfonic acid-modified surface. In one 
embodiment, the virus is purified using ion exchange chromatography comprising a strong anion 
exchange resin (e.g. EMD TMAE) for the first step and EMD-SO3 (cation exchange xesin) for 
the second step. A metal-binding affinity chromatography step can optionally be included for 
further purification. (See, e.^., WO 97/06243). 

A preferred resin for use in the invention is Fractogel™ EMD. This synthetic methacrylate 
based resin has long, linear polymer chains (so-called "tentacles") covalently attached. This 
"tentacle chemistry" allows for a large amount of sterically accessible ligands for the binding of 
biomolecules without any steric hindrance. This resin also has improved pressure stability. 

Column-based liquid affinity chromatography is another preferred purification method for " 
use in the invention. One exanaple of a resin for use in this purification method is Matrex""^ 
Cellufine™ Sulfate (MCS). MCS consists of a rigid spherical (approx. 45-105 fxm diameter) 
cellulose matrix of 3,000 Dalton exclusion limit (its pore strucmre excludes macromolecules), 
with a low concentration of sulfate ester functionality on the 6-position of cellulose. As the 
functional ligand (sulfate ester) is relatively highly dispersed, it presents insufficient cationic 
charge density to allow for most soluble proteins to adsorb onto the bead surface. Therefore the 
bulk of the protein found in typical virus pools (cell culture supematants, e.g. pyrogens and most 
contaminating proteins, as well as nucleic acids and endotoxins) are washed from the column 
and a degree of purification of the bound virus is achieved. 

The rigid, high-strength beads of MCS tend to resist compression. The pressure/flow 
characteristics the MCS resin permit high linear flow rates allowing high-speed processing, even 
in large columns, making it an easily scalable unit operation. In addition a chromatographic 
purification step with MCS provides increased assurance of safety and product sterility, avoiding 
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excessive product handling and safety concerns. As endotoxins do not bind to it, the MCS 
purification step aUows a rapid and contaminant free depyrogenation. Gentle binding and 
elution conditions provide high capacity and product yield. The MCS resin therefore represents 
a simple, rapid, effective, and cost-saving means for concentration, purification and 
depyrogenation. In addition, MCS resins can be reused repeatedly. 

The inactivated virus may be further purified by gradient centrifiigation, preferably density 
gradient centrifiigation. For commercial scale operation a continuous flow sucrose gradient 

centrifiigation would be the preferred option. This method is widely used to purify antixdral 
vaccines and is known to the expert in the field {^See JP Giegersen "Herstellung von 
Virussimpfstoffen aus Zellkulturen" Chapter 4.2 in Phannazeutische Biotecnologie (eds. O. 
Kayser and RH Mueller) Wissenschaftliche Verlagsgesellschaft, Smttgart, 2000.) 

The density gradient centrifiigation step may be performed using laboratory or commercial 
scale gradient centrifiigation equipment. For example, a swinging bucket rotor, a fixed angle 
rotor, or a vertical tube rotor, particularly for laboratory scale production of the virus. 

Preferably, the gradient centrifugation step is performed using a swinging bucket rotor. This 
type of rotor has a sufficiently long pathlength to provide high quality separations, particularly 
with multicomponent samples. In addition, swingmg bucket rotors have greatly reduced wall 
effects, and the contents do not reorient during acceleration and deceleration. Because of their 
longer pathlength. separations take longer compared to fixed angle or vertical tube rotors. The 
prepared sucrose solutions are controlled via refiractometer on their sucrose concentration. 

Sucrose gradients for density gradient centrifugation, such as in a swinging bucket 
centrifiige tubes may be formed prior to centrifiigation by the use of a gradient former 
(continuous/linear). The volume of sample which can be applied to the gradient in a swinging 
bucket rotor tube is a fiinction of the cross-sectional area of the gradient that is exposed to the 
sample. If the sample volume is too high, there is not sufficient radial distance in the centrifiige 
tube for effective separation of components in a multicomponent sample. 

An approximate sample volume for swinging bucket rotor SW 28 is 1-5 ml per tube (with a • 
tube diameter of 2.54 cm). The sample is applied to the gradient by pipetting the volume on top 
of the gradient. The blunt end of the pipette is placed at 45-60*' angle to the tube wall, 
approximately 2-3 mm above the gradient. The sample is injected slowly and allowed to run 
down the wall of the tube onto the gradient. After centrifugation gradient fractions are recovered 
by carefully inserting a gauge needle until the bottom of the tube and starting to collect fractions 
of 2 ml by pumping the liquid from the tube into falcon tubes. 

Sucrose density gradients suitable for use with this density gradient centrifiigation 
purification step include 0 - 60%, 5 - 60%. 15 - 60%, 0 - 50%, 5 - 50%. 15 - 50%, 0 - 40%. 5 
- 40%, and 15 - 40%. Preferably, the sucrose density gradient is 15 - 40%. 5 - 40% or 0 - 40%. 
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Alternatively, a discontinuous sucrose density gradient may be used for purification. A 
discontinuous sucrose density scheme provides for discrete, overlaying layers of differing 
sucrose concentrations. In one example, a first layer of 50% sucrose is covered by a second 
layer of 40% sucrose; the second layer is covered by a third layer of 20% sucrose; the third layer 
is covered by a fourth layer of 10% sucrose; and the fourth layer is covered by the solution 
containing the virus to be purified. 

In one embodiment, inactivated virus is purified by a method comprising a first step of 
chromatography purification and a second step of gradient centrifugation.; Preferably the first 
step comprises liquid affinity chromatography, such as MCS. Preferably, the second step 
comprises density gradient centrifugation using a swinging bucket rotor. 

Additional purification methods which may be used to purify inactivated S ARS virus 
include the use of a nucleic acid degrading agent, preferably a nucleic acid degrading enzyme, 
such as a nuclease having DNase and RNase activity, or an endonuclease, such as from Serratia 
marcescens, commercially available as Benzonase™, membrane adsorbers with anionic 
functional groups {e.g. Sartobind™) or additional chromatographic steps with anionic functional 
groups ie.g. DEAE or TMAE). An ultrafiltration/dialfiltration and final sterile filtration step 
could also be added to the purification method. 

Preferably, the purification includes treatment of the SARS viral isolate with one or more 
nucleic acid degrading enzymes. These enzymes may be used to reduce the level of host cell 
nucleic acid in the viral purification process. Nucleic acid digesting enzymes for use in cell 
culture are known in the art and include, for example, Benzonase™. 

The treatment of the virus with the nucleic acid degrading enzyme and inactivating agent 
can be performed by a sequential treatment or in a combined or simultaneous manner. 
Preferably, the nucleic acid degrading agent is added to the virus preparation prior to the addition 
of the inactivating agent. 

The purified viral preparation of the invention is substantially fi-ee of contaminating 
proteins derived fix)m the cells or cell culture and preferably comprises less than about 1000, 
500, 250, 150, 100, or 50 pg cellular nucleic acid / /tg virus antigen, preferably less than about 
1000, 500, 250, 150, 100, or 50 pg cellular nucleic acid/ dose. Still more preferably, the purified 
viral preparation comprises less than about 20 pg, and even more preferably, less than about 10 
pg. Methods of measuring host cell nucleic acid levels in a viral sample are known in the art. 
Standardized methods approved or recommended by regulatory authorities such as the WHO or 
the FDA are preferred. 

The invention includes an inactivated vaccine composition comprising a prophylactically 
effective amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. 
The SARS viral antigen is preferably present in a concentration amount of 0.1 to 50 ;tg 
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antigen/dose, more preferably 0.3 to 30 ng antigen/dose. StDl more preferably, the antigen is 
about 15 jug/dose. 

In one embodiment, a lower concentration of S ARS viral antigen is used in inactivated 
vaccine compositions of the invention. Such lower concentration vaccines may optionally 
comprise an adjuvant to boost the host immune response to the antigen. In such a "low dose" 
vaccine, the SARS viral antigen is preferably present in a concentration of less than 15 jUg 
antigen/dose, less than 10, 7.5, 5 or 3 /ig antigen/dose. 

The inactivated vaccine preparations of the invention may further comprise a stabilizer to 
preserve the integrity of the inmiunogenic proteins in the inactivated viral preparation. 
Stabilizers suitable for use in vaccines are known in the art and may include, for example, 
buffers, sugars, sugar alcohols, and amino acids. Stabilizing buffers are preferably adjusted to a 
physiological pH range and may include phosphate buffers, Tris buffers, TE (Tris/EDTA), TEN 
(Tris/NaCl/EDTA) and Earle's salt solution. StabiUzing sugars may include, for example; one or 
more of saccharose, glucose, fructose, dextranes, dextranesulphate, and trehalose. Stabilizing 
sugar alcohols may include, for example, Xylite/Xylitole, ManniteAlannitol, Sorbite/Sorbitol, 
and Glycerol. Amino acids suitable for use in the invention include, for example, L-glutamine, ' 
arginine, cysteine, and lysine. Additional stabilizers which may be used in the invention includL 
Tartaric acid, Pluronic F 68, and Tween 80. 

SARS viral isolates which may be used for the inactivated viral preparations of the 
invention may be obtained and identified by any of the mechanisms described supra. For 
example, a SARS isolate may be obtained from a clinical sample and plaque purified. Such 
methods of viral isolation are known in the art. 

Further purification procedures can be applied to ensure the seed virus used for preparation 
of the vaccine does not contain, for example, unwanted adventitious agents. In one embodiment, 
viral RNA from the viral isolate can be isolated from the virus, purified (and. optionally, the 
sequence verified through PGR or other means) and then introduced into a suitable cell culture. 

As an example of this technique, a clinical viral sample is plaque purified and amplified on 
vero cells to generate a sufficient amount of the viral sample for analysis. Cellular remnants are 
then cleared from the supernatant by centrifugation. The virus can then be pelleted by 
ultracentrifugation and the pellet resuspended in PBS. After further centrifugation purification, 
the virus containing fraction is treated with a DNase (and optionally also an RNase). Viral RNA 
is then isolated from this fraction and tiransfected into a host cell. 

Bcamples 2 and 3 provide an illustration of purification of inactivated whole SARS virus 
using MCS chromatography resin purification followed by density gradient ultracentiifugation. 
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Routes and methods of immunization of the vaccines of the invention are discussed in 
more detail in a section below. Examples 4 and 5 provide illustrations of a mouse immunization 
scheme with the inactivated SARS virus of the invention. 
B. Attenuated SARS Vaccines 
5 The invention includes a composition comprising an attenuated SARS virus. This 

composition can be used as a prophylactic or therapeutic SARS virus vaccine. Methods of 
attenuating viruses are known in the art. Such methods include serial passage of the SARS virus 
in cultured cells ie.g. , mammalian cell culture, preferably fetal rhesus kidney cells or VERO 
cells-see the discussion in Section A above regarding culture of SARS virus), until the SARS 

10 virus denionstrates attenuated function. The temperature at which the virus is grown can be any 
temperature at which with tissue culture passage attenuation occurs. Attenuated function of the 
SARS virus after one or more passages in cell culture can be measured by one skilled in the art. 
As used herein, attenuation refers to the decreased virulence of the SARS virus in a human 
subject. Evidence of attenuated function may be indicated by decreased levels of viral 

15 replication or by decreased virulence in an animal model. 

Other methods of producing an attenuated SARS virus include passage of the virus in cell 
culture at sub-optimal or "cold" temperatures and introduction of attenuating mutations into the 
SARS viral genome by random mutagenesis (e.g., chemical mutagenesis) or site specific directed 
mutagenesis. Preparation and generation of attenuated RSV vaccines (the methods of which will 

20 generally appUcable to SARS virus) are disclosed in, for example, EP 0 640 128, US Patent No. 
6,284,254, US Patent No. 5,922,326, US Patent No. 5,882,651. 

The attenuated derivatives of SARS virus are produced in several ways, such as for ■ 
example, by introduction of temperature sensitive-mutations either with or witiiout chemical 
mutagenesis (e.g., 5-fluorouracil), by passage in culture at "cold" temperatures. Such cold 

15 adaptation includes passage at temperatures between about 20''C to about 32°C, and preferably 
between temperatures of about 22°C to about 30**C, and most preferably between temperatures 
of about 24"'C and 28*0. The cold adaptation or attenuation may be performed by passage at 
increasingly reduced temperatures to introduce additional growth restriction mutations. The 
number of passages required to obtain safe, immunizing attenuated virus is dependent at least in 

iO part on the conditions employed. Periodic testing of the SARS virus culture for virulence and 
immunizing ability in animals (e.g., mouse, primate) can readily determine the parameters for a 
particular combination of tissue culture and temperature. The attenuated vaccine will typically be 
formulated in a dose of from about 10^ to 10^ PFU or TCID50, or more for maximal efficacy. 
Attenuated virus vaccines for S ARS-CoV also are produced by creating virus chimeras 

>5 comprising sequences derived from at least two different coronaviruses, one of which is a SARS- 
CoV. For example, a virus chimera is produced that comprises nonstmcmral protein encoding 
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genes derived from a first coronavirus (e.g., murine, bovine, porcine, canine, feline, avian 
coionavirus) and one or more structural protein encoding genes (e.g., spike, E, M) from a SARS- 
CoV. Alternatively, the virus chimera may comprise sequences derived from a human 
coronavims that is not a SARS-CoV (e.g., OC43, 229E) together with sequences from a SARS- 
CoV. Chimeric coronaviruses of the present invention are generated by a variety of methods, 
including for example allowing for natural RNA recombination in a eukaryotic (e.g. , 
mammalian) cell that contains RNA from each of the parental coronaviruses (e.g., foUowing 
infection) or by using standard molecular biology techniques known to those of skill in the art to 
engineer desired virus chimeras (or portions thereof) as cDNA clones, which may then be used to 
produce infectious virus (see for example. US 65931 1 1 B2; Yount et al., 2003, Proc. Natl Acad 
Sci. USA 100(22):12995-13000). An attenuated phenotype of the coronavirus chimeras 
described herein can be readily measured by one of skill in the art. 

Attenuated viruses can be also generated by deleting one or more open reading frames 
(ORFs) that are not essential for viral repHcation. Preferably, these deletions occur in the- 
structural region of the genome, such as ORF 3a, 3b, 6, 7a, 7b, 8a, 8b. 9b. See e.g.. Haijema BJ, . 
Volders H. Rottier PJ. J Virol. (2004) 78(8):3863-71; and de Haan. C. A., P. S. Masters. X. Shen, 
S. Weiss, and P. J. Rottier, "The group-specific murine coronavirus genes are not essential, but ' 

their deletion, by reverse genetics, is attenuating in the natural host." Virology (2002) 296:177- 
189. Deletion of such regions within a coronavirus such as SARS can be achieved, for example, 

by reverse genetics or "targeted recombination" (See, e.g.. Masters, P. S., "Revei^e genetics of ' 

the largest RNA viruses". Adv. Virus Res. (1999) 53:245-264. 

Methods of purification of attenuated virus are known in the art and may include one or 

more of, for instance gradient centrifugation and chromatography. See Gregersen "Herstellung 

von Virussimpfstoffen aus Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 

Kayser and RH Mueller) WissenschaftUche Verlagsgesellschaft, Stuttgart. 2000. 

C. SvlitSARS Vaccines 

The invention includes a composition comprising a split SARS virus formulation and 

methods for the manufacture thereof. This composition can be used as a prophylactic or 

therapeutic SARS virus vaccine. 

Metiiods of splitting enveloped viruses are known in the art. Methods of splitting 

enveloped viruses are disclosed, for example, in WO 02/28422, incorporated herein by reference 

in its entirety, and specifically including the splitting agents and methods described tiierein. 

Methods of splitting influenza viruses are disclosed, for example, in WO 02/067983 WO 

02/074336, and WO 01/21151, each of which is incorporated herein by reference in hs entirety. 
The splitting of the virus is carried out by disrupting or fragmenting whole virus, infectious 

(wild-type or attenuated) or non-infectious (for example inactivated), with a disruptiiig 
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concentration of a splitting agent The disraption results in a full or partial solubilisatioh of the 
virus proteins, altering the integrity of the virus. 

Preferably, the splitting agent is a non-ionic or an ionic surfactant. Accordingly, the split 
SARS virus formulations of the invention may also comprise at least one non-ionic surfactant or 
5 detergent. Examples of splitting agents useful in the invention include: bile acids and derivatives 
thereof, non-ionic surfactants, alkylglycosides or alkylthicglycosides and derivatives thereof, 
acyl sugars, sulphobetaines, betains, polyoxyethylenealkylethers, NJST-dialkyl-Glucamides, 
Hecameg, allcylphenoxypolyethoxyethanols, quaternary ammonium compounds, saicosyl, CTAB 
(cetyl trimethyl ammonium bromide) or Cetavlon. 
10 Preferably, the ionic surfactant is a cationic detergent. Cationic detergents suitable for use 

in the invention include detergents comprising a compound of the following formula: 

wherein , 
Ri, R2 and R3 are the same or different and each signifies alkyl or aryl, or 
15 Ri and R2, together with the nitrogen atom to which these are attached form a 5- or 6- 

membered heterocyclic ring, and 
R3 signifies alkyl or aryl, or 

Ri, R2 and R3 together with the nitrogen atom to which these are attached, signify a 5- or 6- 
membered heterocyclic ring, unsaturated at the nitrogen atom, 
20 R4 signfies alkyl or aryl, and 

X signifies an anion. 

Examples of such cationic detergents are cetyltrimethylanunonium salts, such as 
ceytltrimethylammonium bromide (CTAB) and myristyltrimethylammonium salt. 

Additional cationic detergents suitable for use in the invention include lipofectine, 
15 lipofectamine, and DOT-MA. 

Non-ionic surfactants suitable for use in the invention include one or more selected from 
the group consisting of the octyl- or nonylphenoxy polyoxyethanols (for example the 
commercially available Triton series), polyoxyethylene sorbitan esters (Tween series) and 
polyoxyethylene ethers or esters of the general formula : 
JO 0(CH2CH20)n-A-R 

wherein n is 1-50, A is a bond or -C(0)-, R is Ci-50 alkyl or phenyl G1.50 alkyl; and 
combinations of two or more of these. 

The invention comprises a method of preparing a split SARS virus comprising contacting 
the SARS virus with a sufficient amount of splitting agent to disrupt the viral envelope. The loss 

-116- 



BNSDOCIO: <W O 2 0040923eOAg I > 



JO 



wo 2004/092360 PCT/US2004/011710 

of integrity after spUtting renders the virus non-infectious. Once the disrupted viral envelope 
proteins are generally no longer associated with whole intact virions, other viral proteins are 
preferably fiilly or partially solubiUzed and are therefore not associated, or only in part 
associated, with whole intact virions after splitting. 

The method of preparing a split SARS virus may further comprise removal of the splitting 
agents and some or most of the viral lipid material. The process may also include a number of 
different filtration and/or other separation steps such as ultracentrifugation, ultrafiltration, zonal 
centrifugation and chromatographic steps in a variety of combinations. The process may also 
optionally include an inactivation step (as described above) which may be carried out before or 
after the splitting. The splitting process may be carried out as a batch, continuous, or semi- 
continuous process. 

SpUt SARS virus vaccines of the invention may include structual proteins, membrane 
firagments and membrane envelope proteins. Preferably, the split SARS virus preparations of the 
invention comprise at least half of the viral structural proteins. 

One example of a method of preparing a split SARS virus fonnulation includes the 
following steps: 

(i) propagation of the SARS virus in ceU culture, such as MRC-5 cells (ATCC CCI^171), 
WI-38 cells (ATCC CCL-75), fetal rhesus kidney cells or vero cells (See the discussion in 
Section A, above, regarding culture of SARS virus); 

(ii) harvesting of SARS virus-containing material from the cell culture; 

(iii) clarification of the harvested material to remove non-SARS virus material; 

(iv) concentration of the harvested SARS virus; 

(v) separation of the whole SARS virus from non-virus material; 

(vi) splitting of the whole SARS virus using a suitable splitting agent in a density gradient 
centrifugation step; and 

(vii) filttation to remove undesired materials. 

The above steps are preferably performed sequentially. 

The clarification step is preferably performed by centrifugation at a moderate speed. 
Alternatively, a filtration step may be used for example with a 0.2/im membrane. 

The concentration step may preferably employ an adsorption method, for instance, using 
CaHPO*. Alternatively, filtration may be used, for example ultrafiltration. 

A fiirther separation step may also be used in the method of the invention. This further 
separation step is preferably a zonal centrifugation separation, and may optionally use a sucrose 
gradient. The sucrose gradient may further comprise a preservative to prevent microbial growth. 

The splitting step may also be performed in a sucrose gradient, wherein the sucrose 
gradient contains the splitting agent. 
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The method may further comprise a sterile filtration step, optionally at the end of the 
process. Preferably, there is an inactivation step prior to the final filtration step. 

Methods of preparing split S ARS virus formulations may further include treatment of the 
viral formulation with a DNA digesting enzyme. These enzymes may be used to reduce the level 
5 of host cell DNA in the viral purification process. DNA digesting enzymes for use in cell culture 
are known in the art and include, for example, Benzonase®. 

Treatment of the S ARS virus formulation with a DNA digesting enzyme may occur at any 
time in the purification and splitting process. Preferably, however, the SARS virus formulation 
is treated with a DNA digesting enzyme prior to use of a detergent. Still more preferably, the 
:0 SARS virus formulation is treated with a DNA digesting enzyme, such as Benzonas, prior to 
treatment with a cationic detergent, such as CTAB. 

Methods of purification of split vims are known in the art. See JP Gregersen **Herstellung 
von Virussimpfstoffen aus Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 
Kayser and RH Mueller) Wissenschaftliche Yerlagsgesellschaft, Stuttgart, 2000. 
5 The invention includes a split vaccine composition comprising a prophylactically effective 

amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. The 
SARS viral antigen is preferably present in a concentration amount of 0. 1 to 50 jUg antigen/dose, 
more preferably 0.3 to 30 (ig antigen/dose. Still more preferably, the antigen is about 15 
pig/dose. 

0 In one embodiment, a lower concentration of SARS viral antigen is used in split vaccine 

compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a "low dose" vaccine; the 
SARS viral antigen is preferably present in a concentration of less than 15 fig antigen/dose, 
less than 10, 7.5, 5 or 3 /zg antigen/dose. 

5 D. Subunit SARS Vaccines 

The invention includes a composition comprising an isolated or purified SARS viral 
antigen or a derivative thereof. The composition may further comprise one or more adjuvants. 

SARS viral antigens can be isolated or purified from a SARS virus grown in cell culture. 
Alternatively, SARS viral antigens can be recombinantiy produced by methods known in the art. 

D The SARS viral antigens used in the invention can be produced in a variety of different 

expression systems which are known in the art; for example those used with mammalian cells, 
baculoviruses, bacteria, and yeast. Such expression systems will typically use polynucleotides 
encoding the viral antigens of the invention. Such sequences can be obtained using standard 
techniques of molecular biology, including translating the amino acid sequences listed herein. 

5 Accordingly, the invention includes polynucleotides encoding for tiie viral antigens of die 
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invention. In addition, the viral antigens of the invention can be produced (at least in part, 
preferably in whole) via synthetic chemistry methods. 

Insect cell expression systems, such as baculovirus systems, are known to those of skill in 
the art and described in, e.g.. Summers and Smith, Texas Agricultural Experiment Station 
Bulletin No. 1555 (1987). Materials and methods for baculovirus/insert cell expression systems 
are commercially available in kit form from, inter alia, Invitrogen, San Diego CA. Similarly, 
bacterial and mammalian cell expression systems are also known in the art and described in, e.g., 
y^oj/ Ge/zeftc ^-/i^megnng (Barr a/., eds., 1989) Butterworths, London. 

A number of appropriate host cells for use with the above systems are also known. For 
example, mammalian cell lines are known in the art and include immortaHzed cell Hnes available 
from the American Type Culture Collection (ATCC), such as. but not Umited to Chinese 
hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells 
ie.g.. Hep G2), Madin-Darby bovine kidney ("MDBK") cells, as well as others. Mammalian 
sources of cells include, but are not limited to, human or non-human primate {e g MRC-5 
(ATCC CCL-171), WI-38 (ATCC CCL-75), fetal rhesus lung cells (ATCC CL-160). human 
embryonic kidney cells (293 cells, typically transformed by sheared adenovirus type 5 DNA). 
VERO cells from monkey kidneys), horse, cow {e.g., MDBK cells), sheep, dog (e.g. MDCk' 
cells from dog kidneys. ATCC CCL34 MDCK (NBL2) or MDCK 33016. deposit number DSM 
ACC 2219 as described in WO 97/37001). cat. and rodent {e.g.. hamster ceUs such as BHK21-F 
HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be obtained from a wide 
variety of developmental stages, mcluding for example, adult, neonatal, fetal, and embryo. 

Similarly, bacterial hosts such as E. coli. Bacillus subtilis, and Streptococcus spp., will find- 
use with the present expression constructs. Yeast hosts useful in the present invention include. 
inter alia, Saccharomyces cerevisiae, Candida albicaiis, Candida maltosa, Hansenual 
polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Pichia guUlerimondii, Pichia 
pastoris, Schizosaccharomycesponibe and Yarrowia lipolytica. Insect cells for use with 
baculovirus expression vectors include, inter alia, Aedes aegypti, Autographa califomica. 
Bombyx mori, Drosophila fnelanogaster, Spodopterafrugiperda, and Trichoplusia ni. 

Nucleic acid molecules comprising nucleotide sequences of the viral antigens or antibodies 
of the invention can be stably integrated into a host cell genome or maintained on a stable 
episomal element in a suitable host cell using various gene delivery techniques well known in the 
art. See., e.g., US Patent No. 5,399,346. 

Depending on the expression system and host selected, the molecules are produced by 
growing host cells transformed by an expression vector under conditions whereby the protein is 
expressed. The expressed protein is then isolated from the host ceUs and purified. Ifthe 
expression system secretes the protein into growth media, the product can be purified directly 
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from the media. If it is not secreted, it can be isolated from ceil lysates. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The invention includes a composition comprising an isolated or purified S ARS viral 
antigen or a derivative thereof. The invention also includes a composition comprising at least 
two isolated or purified SARS viral antigens or derivatives thereof, which have been co-purified 
or purified separately and then combined. In one embodiment, the SARS viral antigen is a spike 
(S) protein. In yet another embodiment, the SARS viral antigen is a nucleocapsid (N) protein, a 
membrane (M) glycoprotein, or an envelope (E) protein. Preferably, the SARS viral antigen is 
present in the composition in a purity greater than 75% (e.g., 78%, 80%, 82%, 85%, 88%, 90%, 
92%, 95%, 98%). 

The invention includes a vaccine composition comprising a prophylactically effective 
amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. The 
SARS viral antigen is preferably present in a concentration amount of 0.1 to 50 fig antigen/dose, 
more preferably 0.3 to 30 fig antigen/dose. Still more preferably, the antigen is about 15 
/Ag/dose. 

In one embodiment, a lower concentration of SARS viral antigen is used in vaccine 
compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a "low dose" vaccine, the 
SARS viral antigen is preferably present in a concentration of less than 15 jtig antigen/dose, (i.e., 
less than 10, 7.5, 5 or 3 /ig antigen/dose. 

The following example illustrates a method of preparing a SARS virus spike (S) protein 
subunit vaccine. 

SARS virus S antigen may be isolated and purified from a variety of sources and using a 
variety of methods, including, but not limited to, S antigen expressed in cultured eukaryotic cells 
(e.g., mammalian cells, such as VERO, CHO) or bacteria (e.g., E. coli). Expression of may be 
achieved by a variety of means, such as, for example, from SARS virus infected cell culture or 
cell culture supematants, from cultured cells stably transformed with a DNA expression cassette 
encoding the SARS virus S protein (e.g., RNA polymerase n promoter operably linked to a 
SARS virus S gene), or from cultured cells infected with a replication-competent or replication- 
incompetent virus-based expression vector (e.g., adenovirus vector, poxvirus vector, alphavirus 
vector, retrovirus vector) encoding the SARS virus S protein, as a means to eliminate the need to 
work with infectious SARS virus. 

1. Subunit SARS Vaccines Produced from SARS Virus Cultures 

The SARS virus may be grown in cultured mammalian celle, such as VERO cells, then 
separated from the cultured cells. A SARS viral antigen, such as the S protein, can then be 
solubilized and separated from the SARS virus, and further isolated and purified. 
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In one example, the S ARS viras may be produced as described in the Inactivated S ARS 
vaccine examples, then the desired SARS antigen, such as spike protein, may be further purified 
from the end product using techniques known in the art. 

In another example, a SARS subunit vaccine may be produced as follows. SARS virus 
may be produced using a desired mammalian cell line on microcarrier beads in large, controlled 
fermentors. For example, vaccine quality African Green Monkey kidney ceUs (VERO ceUs) at a 
concentration of 10^ cells/mL are added to 60 to 75 L of CMRL 1969 media, pH 7.2, in a 150 L 
bioreactor containing 360 g of Cytodex-1 microcarrier beads and stirred for 2 hours. Additional 
CMRL 1969 is added to give a total volume of 150 L. Fetal bovine serum (FBS) is added to a 
final concentration of 3.5%. Glucose is added to a final concentration of 3.0 g/L and glutamine 
is added to a final concentration of 0.6 g/L. Dissolved oxygen, pH, agitation and temperature are 
controlled, and cell growth, glucose, lactate and glutamine levels are monitored. When cells are 
in logarithmic phases usually on days 3 to 4 reached a density of about 1.0-2.5x10^ cells/mL, the 
culture medium is drained from the fermentor and 120 L of CMRL 1969, pH 7.2 (no FBS) is 
added and the culture stirred for 10 minutes. The draining and filling of the fermentor is usuaUy 
repeated once but could be repeated up to three times. After washing the ceUs, the fermentor is 
drained and 50 L of CMRL 1969 containing 0.1% (v/v) FBS is added. The SARS virus 
inoculum is added at a multiplicity of infection (m.o.i.) of 0.001 to 0.01. Trypsin may be added 
to promote efficient infection. Additional CMRL 1969 with 0.1% FBS is added to give a final 
volume of 150 L. Incubation is continued at 34 C. One viral harvest is obtained firom a single 
fermentor lot, typically at 2-7 days post-infection. Multiple harvests fiwm a single fermentation 
may also be obtained. 

The isolation and purification of S protein may be effected by a variety of means, as 
described below. For example, collecting S protein-containing flow-through fix>m ion exchange 
chromatography of solubilized SARS virus envelope proteins; loading the flow through onto a 
hydroxyapatite matrix, and selectively eluting the S protein fi»m the hydroxyapatite matrix. The 
selectively eluted S protein may be further concentrated by tangential flow ultrafiltration. 

Alternatively, the isolation and purification may be effected by coUecting S protein- 
containing flow-through firom ion exchange chromatography of the solubilized SARS virus 
envelope proteins; loading the flow through onto a hydroxyapatite matrix and collecting an S 
protein-containing flow through, selectively removing detergent used in the solubilization step 
from the hydroxyapatite matrix flow through to provide isolated and purified S protein. The 
isolated and purified S protein may be subsequently concentrated by tangential flow 
ultrafiltration 
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Nucleic acid contaminants may be removed from the isolated and purified S protein by 
treatment with a nucleic acid degrading agent as described above in the Inactivation section. 
Preferably, the nucleic acid degrading agent is a nuclease, such as for example, Benzonase. 

The isolated and purified S protein may be apphed to a gel filtration medium and the S 
protein subsequently coUected therefrom to separate the S protein from contaminants of other 
molecular weights. 

Alternatively, the isolation and purification may be effected by loading S protein on a first 
ion-exchange medium while permitting contaminants to pass through the medium, eluting the S 
protein from the first ion-exchange medium, to separate the S protein fix>m contaminants of other 
molecular weights. The eluted S protein is applied to a second ion-exchange medium while 
allowing contaminants to pass through the second ion-exchange medium. ' The S protein is 
subsequently eluted therefrom, to provide the isolated and purified S protein. The eluted S 
protein may be. concentrated by tangential flow ultrafiltration. 

Altematively, substantially pure S ARS virus S protein suitable for use as an immunogen in 
a subunit vaccine formulation may be prepared from infected cell lysates, such as for example 
using a non-denaturing detergent buffer containing 1% Triton X-100 and deoxycholate to lyse 
infected cells. The cell lysates are clarified by centrifugation and S protein is purified from the 
cell lysates by immunoaffinity purification. A monoclonal antibody against the S protein is 
generated and coupled to beads and a colunm is constructed with those beads. SARS-infected 
cell lysates are applied to the colunm, and the column is washed with PBS containing 0.1% 
Triton X-100. Protein bound to the colunm is eluted with O.IM glycine, pH 2.5, 0.1% Triton X- 
100. Elution samples are buffered, such as for example, with Tris, and analyzed for the presence 
of protein. Fractions containing the protein are pooled and dialyzed against PBS 

As discussed above, the present invention includes isolated and purified S protein of SARS 
virus. In one example, the virus is grown on a vaccine quaUty cell line, such as VERO cells, and 
the grown virus is harvested. The virus harvest is filtered and then concentrated typically using 
tangential flow ultrafiltration using a membrane of desired molecular weight cut-off and 
diafiltered. The virus harvest concentrate may be centrifuged and the supernatant discarded. The 
pellet from the centrifugation then is detergent extracted to solubilize the S; protein, for example, 
by resuspending the pellet to the original harvest concentrate volume in an extraction buffer 
containing a detergent such as a non-ionic detergent including TRITON X-100. 

FoUowing centrifugation to remove non-soluble proteins, the S protein extract is purified 
by chromatographic procedures. The extract may first be applied to an ion exchange 
chromatography column such as a TMAE-fractogel or S-fractogel column equilibrated to permit 
the S protein to flow through while impurities are retained on the column. 
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provide a purified solution of the protein. This solution may be concentrated by tangential flow 
ultrafiltration using a membrane of desired molecular weight cut-off and diafiltered. 

Specifically, in one method of S protein purification, the virus harvest concentrate is 
centrifuged at 28,000 x g for 30 minutes at 4 C. The supernatant is discarded and the pellet 

5 resuspended in extraction buffer consisting of 10 mM Tris-HCl, pH 7.0, 150 mM NaCl, 2% 
(w/v) Triton X-100 to the original harvest concentrate volume. Pefabloc is added to a final 
concentration of 5 mM. The suspension is stirred at room temperature for 30 minutes. The 
supernatant, containing the soluble S protein, is clarified by centrifugation at 28,000 x g for 30 
minutes at 4 C. A TMAE-Fractogel column is equilibrated with 10 mM Tris-HCl, pH 7.0, 150 

10 mM NaCl containing 0.02% Triton X-100. The Triton X-100 supernatant, containing the soluble 
S protein, is loaded directly onto the TRAE-Fractogel column. The total volume added plus 2 • 
bed volumes of 10 mM Tris-HCl, pH 7.0, 150 mM NaCl containing 0.02% Triton X-100 are 
collected. The TMAE-Fractogel flow-through containing S protein is diluted 3-fold with 10 
mM Tris-HCl, pH 7.0, containing 0.02% Triton X-100. 

15 An hydroxyapatite column is equilibrated witii 10 niM Tris-HCl, pH 7.0, 50 mM NaCl, 

0.02% Triton X-100. After loading the TMAE flow-through, the column is washed with 2 
column volumes of 10 mM Tris-HCl, pH 7.0, 50 mM NaCl , 0.02% Triton X-100 followed by 4 
column volumes of 5 mM sodimn phosphate, pH 7.0, IM NaCl, 0.02% Triton X-100. The 
proteins are eluted with 4 column volumes of 20 mM sodium phosphate, pH 7.0, IM NaCl , 

20 0.02% Triton X-100. Fractions are collected based on A280 and the protein content and antigen 
concentrations are measured. The purified S protein is ultrafiltered by tangential flow 
ultrafiltration using a 300 kDa NMWL membrane. 
2. Recombinant Production of Subunit SARS Vaccines 

As discussed above, SARS virus proteins may be produced by recombinant expression. 

25 Host cells suitable for recombinant expression include bacterial, manunalian, insect, yeast, etc. 
Recombinant expression may be used to produce a full length SARS protein, a fii^gment thereof, 
or a fusion therewith. 

Fusion peptides may be used to facilitate the expression and purification of the 
recombinant SARS protein. For example, recombinant production of the SARS polypeptides 

30 can be facilitated by the addition a tag protein to the SARS antigen to be expressed as a fusion 
protein comprising the tag protein and the SARS antigen. Such tag proteins can facilitate 
purification, detection and stabiUty of the expressed protein. Tag proteins suitable for use in the 
invention include a polyarginine tag (Arg-tag), polyhistidine tag CElis-tag), FLAG-tag, Strep-tag, 
c-myc-tag, S-tag, cahnodulin-binding peptide, cellulose-binding domain, SBP-tag„ chitin- 

35 binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 
termination anti-terminiantion factor (NusA), E. coli thioredoxin (TrxA) and protein disulfide 
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isomeraselCDsbA). Preferred tag proteins include His^tag and GST. A fuU discussion on the 
use of tag proteins can be found at Terpe etal, "Overview of tag protein fusions: from 
molecular and biochemical fundamentals to commercial systems", Appl Microbiol Biotechnol 
(2003)60:523-533. 

After purification, the tag proteins may optionally be removed from the expressed ftision 
protein, i.e., by specifically tailored enzymatic treatments known in the art. Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor Xa. 

Accordingly, the invention further includes a S ARS virus subunit vaccine comprising a 
fusion protein. Preferably, the fusion protein comprises a first amino acid sequence encoded by 
a SARS virus polynucleotide sequence. SARS virus polynucleotide sequences which may 
encode said first amino acid sequence include one or more of the SARS virus polynucleotide 
sequences identified in this application and fi-agments thereof . 

The fusion protein may comprise an amino acid sequence of a SARS virus protein or a 
firagment thereof. Said SARS virus protein may be selected from one or more of the group 
consisting of the following SARS virus proteins: P28, P65, Nspl, Nsp2 (3CL protease), Nsp3, 
Nsp3, Nsp4, Nsp 5, Nsp6, Nsp 7, Nsp 8, Nsp 9 (RNA polymerase), Nsp 10 (heUcase), Nsp 11,' 
Nsp 12, Nsp 13, Spike, Orf 3, Orf 4, Envelope, Matrix, Orf 7, Oif 8, Orf 9, Orf 10, Orf 11, 
Nucleocapsid and Orf 13. 

In one embodiment, the fusion protein comprises a first amino acid sequence comprising a - 
SARS virus antigen or a firagment thereof. Said SARS virus amino acid sequence may comprise 
one or more of the T-epitope sequences identified above. 

Preferably, the fiision protein comprises an amino acid sequence of a SARS virus spike 
protein, or a fragment thereof. Specific fragments of the spike protein which may be used in the 
fusion protein include the SI domain and the S2 domain. Further fragments of the spike protein 
which may be used in the fusion protein include regions of each of the SI and S2 domains, 
including the receptor binding region of the SI domain, the oligomerization domain regions of 
the S2 domain, the leucine zipper regions of the S2 domain, the membrane anchor region of the 
S2 domain, the hydrophobic domain region of the S2 domain, the cystein-rich domain region of 
the S2 domain, and the cytoplasmic tail region of the S2 domain. (See HGURE 19). Amino 
acid sequences of the Spike protein corresponding to these regions can be identified by those 
skilled in the art. including, for example, using the functional predictions set forth earher in the 
application (predicted transmembrane helices, predicted N-terminus signaling regions, predicted 
coiled-coil regions, etc.) as well as by homology comparison to the sequences of other known 
Coronaviruses (See FIGURES 4F and 5). 

The fiision protein may further comprise a second amino acid sequence. Said second 
amino acid sequence may comprise a polypeptide sequence which facilitates protein expression 
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or puiification, preferably one of the tag sequences discussed above. Alternatively, said second 
amino acid sequence may comprise a second amino acid sequence from a SARS virus. 
Alternatively, said second amino acid sequence may comprises an amino acid sequence firom 
another virus or bacteria, including one or more of the viruses or bacteria identified in Section I, 
below. 

Said second amino acid sequence may comprise an amino acid sequence from another 
respiratory virus. Said second amino acid sequence may comprise an amino add sequence &pm 
a virus selected from the group consisting of coronavirus, influenza virus^ rhinovirus, 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, ^d metapneumoviras. 

In one embodiment, said second amino acid sequence may comprise an amino acid 
sequence from an adjuvant, including one or more of the adjuvants identified in section I, below. 

In one embodiment, the invention includes a fusion protein comprising an amino acid 
sequence of a SARS virus spike protein or a fragment thereof. The fusion protein may further 
comprise a second amino acid sequence comprising an amino acid sequence selected from the 
group consisting of a second SARS virus protein, a non-SARS virus protein, a bacterial protein, 
and an adjuvant. 

( a) Bacterial Expression ofSubunit SARS Vaccines 

In one embodiment, bacterial host cells are used for recombinant expression of SARS virus 
proteins. Bacterial host cells suitable for use in the invention include, for example, E. coli^ 
Bacillus subtilis, and Streptococcus spp. 

The SARS viral protein may be modified to facilitate bacterial recombinant expression. In - 
particular, the SARS spike protein may be modified to facilitate transport of the spike protein to 
the surface of the bacterial host cell. 

Applicants have discovered that there is strong structural homology between the SARS 
virus spike protein and the NadA protein of Neisseria meningitidis. Both proteins have an N- 
terminal globular "head" domain (amino acids 24-87), an intermediate alpha-helix region with 
high propensity to form coiled-coil structures (amino acids 88-350), and a C-terminal membrane 
anchor domain formed by four amphipatix transmembrane beta strands (amino acids 351-405 of 
NadA). Li addition, a leucine zipper motive is present within the coiled-coil segment. See, 
FIGURE 19 depicting the SARS spike protein structure Comanducci et al, "NadA, a Novel 
Vaccine Candidate of Neisseria meningitidis", J. Exp. Med. 195 (11): 1445-1454 (2002). In 
addition, a leucine zipper motif of NadA is present within the coiled-coil segment. The NadA 
protein also forms high molecular weight surface-exposed oligomers (corresponding to three or 
four monomers) anchored to meningococcal outer membrane. 

When the NadA protein is expressed in E. coli, the full-length protein is assembled in 
oligomers anchored to the outer membrane of E. coli, similar to the way the protein is presented 
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in meningococcus. The NadA protein devoid of the predicted membrane anchor domain is then 
secreted into the culture supernatant. This secreted protein is soluble and still organized in 
trimers. 

The invention therefore includes a fusion protein comprising an amino acid sequence of a 
SARS virus spike protein or a fragment thereof and a second amino acid sequence of a bacterial 
adhesion protein or a fragment thereof. Preferably, said adhesion protein is selected from the 
group consisting of NadA, YadA (of enteropathogenic Yersinia), and UspA2 (of Moraxella 
catarrJialis). Additional NadA-Uke proteins include serum resistance protein DsrA of 
Haemophilus ducreyi, the immunoglobulin binding proteins EibA, C, D, and F of E. coli, outer 
membrane protein 100 of Actinobacillus actinoinycetemcomitans, the saa gene carried on the 
large virulence plasmid present in shiga toxigenic strains of coli (STEC), and each of the 
bacterial adhesion proteins described in U.K. Patent Application No. 0315022.4, filed on June 
26, 2003, each of which are specifically incorporated herein by reference. 

Preferably, said adhesion protein comprises NadA or a fragment thereof. 

Such fusion proteins may be used to facilitate recombinant expression of immunogenic 
portions of SARS surface antigens, such as spike. These fusion constructs may also allow the 
SARS SI and/or S2 domains to adapt to a native confirmation. These fusion proteins are also 
able to oligomerize and form dimers or trimers, allowing the SI and/or S2 domains to asisociate 
and adapt conformations as in the native SARS spike protein. Further, these expression 
constructs facilitate surface exposure of the SARS spike protein. 

The fusion proteins of the invention preferably comprise a leader peptide from a NadA like 
protein, preferably NadA, a polypeptide from the immunogenic "head" region of the spike 
protein, and a stalk region from either the NadA like protein or the Spike protein. During 
expression and processing of the fusion protein, one or more amino acids may be cleaved off or 
removed, such as, the leader peptide or a membrane anchor domain. 

The stalk regions facilitate oligomerization of the expression protein. Optionally, the 
fusion proteins of the invention further include an anchor region of a NadA like protein. This 
anchor region allows the expression fusion protein to anchor and assemble on the bacterial cell 
surface. 

The fusion proteins of the invention include the following constructs: 
(i) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein to facilitate processing of the leader peptide and appropriate maturation of the 
protein) followed by the Spike SI domain. Preferably, this construct comprises amino acids 1- 
29 of NadA (corresponding to the NadA leader peptide and the first 6 amino acids of the mature 
NadA protein, as shown in FIGURE 22 and as set forth below) followed by amino acids 14-662 



-127- 



wo 2004/092360 



PCT/US2004/011710 



of a SARS virus Spike protein (corresponding to the SI domain, see RGURE 19 and SEQ ID 
NO: 6042 and as set forth below). Specifically, construct (i) comprises SEQ ID NO: 7302. 

(ii) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein to facilitate processing of the leader peptide and appropriate maturation of the 

5 protein) followed by the Spike SI domain, followed by the stalk and anchor membrane domains 
of NadA. Preferably, this construct comprises amino acids 1-29 of NadA (corresponding to the 
NadA leader peptide and the first 6 amino acids of the mature NadA protein, as shown in 
FIGURE 22 and as set forth below) followed by amino acids 14-662 of a SARS virus Spike 
protein (corresponding to the SI domain, see FIGURE 19 and SEQ ID NO: 6042 and as set forth 

10 below) followed by amino acids 88-405 of NadA (corresponding to the stalk and the anchor 
membrane domains). Specifically, construct (ii) comprises SEQ ID NO: 7303. 

(iii) the NadA leader peptide (optionally also including the first 6 amino acids of the 
mature NadA protein) followed by a SARS virus Spike SI domain, followed by the NadA stalk 
domain. Preferably, this construct comprises amino acids 1-29 of NadA followed by amino 

15 acids 14-662 of a SARS virus Spike protein (corresponding to the SI domain), followed by 
amino acids 88-350 of NadA (corresponding to the stalk domain). Specifically, construct (iii) 
comprises SEQ ID NO: 7304. 

(iv) the NadA leader peptide (optionally also including the first 6 amino acids of the 
mature NadA protein), followed by a SARS virus Spike SI and S2 domain (excluding the 

20 putative transmembrane region), followed by the anchor domain of NadA. Preferably, this 
construct comprises amino acids 1-29 of NadA, followed by amino acids 14-1195 of a SARS 
virus Spike protein (corresponding to SI and S2, excluding the putative transmembrane region),, 
followed by amino acids 351-405 of NadA (corresponding to the NadA anchor domain). 
Specifically, construct (iv) comprises SEQ ID NO: 7305. Alternatively, the NadA anchor 

25 domain may comprise amino acids 332 - 405 of NadA. 

(v) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein), followed by a SARS virus Spike SI and S2 domain (exclusing the putative 
transmembrane region). Preferably, this construct comprises amino acids' 1-29 of NadA, 
followed by amino acids 14-1 195 of a SARS virus Spike protein. Specifically, construct (v) 

30 comprises SEQ ID NO: 7306. 

In each of constructs (i) to (v), the first 23 amino acids are the NadA leader peptide, and 
the GS dipeptide at residues 679-680 arises from the insertion of a restriction enzyme site. 

In constructs (i), (ii) and (iii), the NadA "head" is replaced by the Spike SI domain, and 
the fusion proteins are anchored to the outer membrane of E.coli or secreted in the culture 
35 supernatant, respectively. In constructs (iv) and (v), the "head" and "stalk" domains of NadA are 
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replaced by SI and S2 Spike domains; also in this case, the two fusion proteins are anchored to 
the outer membrane of E. coli or secreted in the culture supernatant, respectively. 

Accordingly, the invention further includes a fusion protein comprising an amino acid 
sequence of a SARS virus spike protein or a fragment thereof and a second amino acid sequence 
5 of a bacterial adhesion protein or a fragment thereof. Preferably, amino acids corresponding to 
the "head" of the adhesion protein are replaced by amino adds corresponding to a SARS virus 
Spike SI domain. Alternatively, the amino acids corresponding to the "head" and "stalk" 
domains of the bacterial adhesion protein are replaced by amino acids corresponding to the 
SARS virus spike protein SI and S2 domains. 

10 As discussed above and shown in Figure 19, the S 1 domain of the Spike protein is 

identified as the globular receptor binding "head" region. The SI domain of the Spike protein 
preferably comprises about amino acids 14-662 of SEQ ID NO: 6042. The SI domain may 
comprise a shorter amino acid sequence, wherein amino acids are removed from either the N- 
terminal or C-terminal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 amino acids are removed 

15 from either the N-terminal or C-terminal regions. The SI domain further includes amino acid 
sequences having sequence identity to the SI region of SEQ ID NO: 6042. An example of the 
SI domain is SEQ ID NO: 7307: 

As discussed above and shown in Figure 19, the S2 domain of the Spike protein is 
identified as the "stalk" region. The "stalk" region comprises oligomerization domain regions, a 

20 . leucine zipper domain regions, membrane anchor regions, hydrophobic domain regions, cystein- 
rich domain region and a cytoplasmic tail region. The S2 domain of the Spike protein preferably 
excludes the transmembrane region and comprises about amino acids 663-1 195 of SEQ ID NO: 
6042. The S2 domain may comprise a shorter amino acid sequence, wherein amino acids are 
removed from either the N-terminal or C-temiinal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 

25 amino acids are removed from either the N-terminal or C-terminal regions. The S2 domain 

further includes amino acid sequences having sequence identity to the S2 region of SEQ ID NO: 
6042. An example of the SI domain (with the transmembrane region excluded) is SEQ ID NO: 
7308. 

An example of the NadA protein described above is SEQ ID NO: 7309. As discussed 
30 above, the leader sequence of NadA used in the fusion protein preferably comprises about the 
first 29 amino acids of NadA (including a leader sequence with about 6 amino acids of the NadA 
head protein). Examples of such a leader sequences are set forth as SEQ ID NOS: 7310 and 
7311 below. The fusion protein may use a leader sequence comprising a shorter amino acid 
sequence, wherein amino acids are removed from either the N-terminal or C-terminal regions. 
35 Preferably, 1, 2, 3, 4, or 5 amino acids are removed from either the N-terminal or C-terminal end 
of the sequence. The leader sequence used in the fusion protein may also include an amino acid 
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sequences having sequence identity to SEQ ID NO: 7310 or SEQ ID NO: 7311. Preferably, the 
leader sequence comprises SEQ ID NO: 73 1 1 . 

Optionally, the fusion peptide comprises about the first 6 amino acids of the mature NadA 
protein to facilitate processing of the leader peptide and appropriate maturation of the protein. 
An examples of the first 6 amino acids of a mature NadA proteins is SEQ ID NO: 7312.. 

As discussed above, the stalk and anchor sequences of NadA used in the fusion protein 
preferably comprise about amino acids 88-405 of NadA. An example of an amino acid sequence 
comprising NadA stalk and anchor regions is set forth below as SEQ ID NO: 7313 below. An 
example of an amino acid sequence comprising a NadA stalk region (without the anchor region) 
is set forth as SEQ ID NO: 7314 below. An example of an amino acid sequence comprising a 
NadA anchor region is set forth as SEQ ID NO: 7315 below. The fusion protein may use a stalk 
(and/or anchor) sequence comprising a shorter amino acid sequence, wherein amino acids are 
removed from either the N-terminal or C-terminal regions. Preferably, 1, 2, 3, 4, 5, 6, 7^ 8 or 9 
amino acids are removed from either the N-teratiinal or C-terminal end of the sequence. The 
leader sequence used in the fusion protein may also include an amino acid sequences having 
sequence identity to the SEQ ID NO: 7313. , ' 

The fusion proteins of the invention, including those described above, may be prepared, for 
example, as follows. Single fragments (such as the regions described above) may be amplified 
by PGR using the oligonucleotide primers set forth in the Table below. (S 1l refers to the Spike 
protein fused to the leader peptide of NadA; S2 refers to the staDc region of the Spike protein, 
with and without the stop codon). The oligonucleotides were designed on the basis of the DNA 
sequence of NadA from N. meningitidis B 2996 strain and of Spike from SARS virus isolate 
FRAl. Each oligonucleotide includes a restriction site as a tail in order to direct the cloning into 
the expression vector pET21b. 







SEQ ID NO: 


Restriction i 


SIl 


For 


7316 


Ndel 


SIl 


Rev 


7317 


BamHI 


S2 


For 


7318 


Bamm 


S2 


Rev 


7319 


Hindin 


S2-stop 


Rev 


7320 


Xhol 


NadAgg 


For 


7321 


BamHI 


NadAsso 


Rev 


7322 


Xhol 


NadA332 


For 


7323 


Hindm 


NadAtos 


Rev 


7324 


Xhol 



The single fragments are sequentially cloned into pET21b vector, in order to express the 
proteins under the control of inducible T7 promoter. The SI domain of the Spike protein fused 
to the leader peptide of NadA (SIl) was obtained by PGR using the primers SlL-For and SIl- 
Rev. The forward oligonucleotide primer contains the Ndel restriction sequence and the 
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sequence coding for the leader peptide of NadA plus the first 6 aminoacids of the mature protein. 
The PGR fragment was cloned as a NdeJ/BamHI fragment in the pET21b vector opened with the 
same restriction enzymes. This clone (pET-SIl) was then used to sequentiaUy clone the other 
different domains, as BamHI/XhoI, BamHimindin or Hindm/Xhol fragments. BamHI and 
Hindin restriction sites introduce the aminoacids GS and KL, respectively. 

The PGR amplification protocol was as follows: 200ng of genomic DNA from Neisseria 
meningitidis 2996 or 10 ng of plasmid DNA preparation (plasmid pCMVnew, containing the 
entire gene coding of the Spike protein), were used as template in the presence of 40mM of each 
oligonucletide primer. 400-800 jiM dNTPs solution, Ix PGR buffer (including 1.5mM MgGlz), 
2.5 units TaqI DNA polymerase (using Perkin-Elmer AmpliTaQ or Invitrogen Platinum Pfic 
DNA polymerase). 

After a preKminary 3 minute incubation of the whole mix at 95°C, each sample underwent 
a two-step amplification: the first 5 cycles were performed using the hybridisation temperature 
that excluded the restriction enzyme tail of the primer (Tml). This was foUowed by 30 cycles 
according to the hybridisation temperature calculated for the whole length oligos (Tm2); 
Elongation times, performed at 68°G or 72°G, varied according to the length of the fragment to 
be amplified.The cycles were completed with a 10 minute extension step at SS^C or 72*>G. 

The amplified DNA was either loaded directly on agarose gel and the DNA fragment 
corresponding to the band of correct size was purified from the gel using the Qiagen™ Gel 
Attraction Kit, following the manufacturer's protocol. 

The purified DNA corresponding to the amplified fragment and the plasmid vectors' were 
digested with the appropriate restriction enzymes, purified using the QIAquick™ PGR 
purification kit (following the manufacturer's instructions) and ligation reactions were . 
performed. 

The ligation products were transformed into competent E. coli DH5a and screening for 
recombinant clones was performed by growing randomly-selected colonies and extracting the 
plasmid DNA using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions. 

Recombinant plasmids were introduced into E. coli BL21(DE3) used as expression host. 
Single recombinant colonies were inoculated into LB + ampicillin and incubated at SV^G for 14- 
16 h. Bacteria were directly recovered by centrifugation (uninduced conditions) or diluted in 
fresh medium and grown at 37°G until ODgoo between 0.4-0.8. Protein expression was induced 
by addition of 1 mM Isopropyl-l-thio-p-D-galactopyranoside (IPTG) for three hours (induced 
conditions). 

Whole cell lysates were obtained resuspending bacteria in SDS-sample buffer IX and 
boiling for 5-10 min. Equal amounts of proteins were separated using NuPAGE (Invitrogen) or 
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BIORAD Gel System, according to the manufacturer's instructions. Proteins were revealed by 
Coomassie-blue staining or transferred onto nitrocellulose membranes for western blot analysis. 
Western blot was performed using a rabbit polyclonal anti-serum against purified NadAA35i-«)s 
(diluted 1;3000) and a secondary peroxidase-conjugate antibody (DAKO). 
5 Results of the expression in E.coli of S 1l, S iL-NadA and S iL-NadAAanchor are shown in 

FIGURES 38 and 39. Schematics of the fusion constructs are shown in FIGURE 37. 

Bacterial expression of the SARS viral antigens may also be used to prepare compositions 
comprising outer membrane vesicles wherein said outer membrane vesicles comprise one or 
more SARS viral antigens. 

10 Outer Membrane Vesicles C*OMV"), also referred to as blebs, refer to vesicles formed or 

derived from fragments of the outer membrane of a Gram negative bacterium. OMVs typically 
comprise outer membrane proteins (OMPs), lipids, phospholipids, periplasmic material and 
lipopolysaccharide (LPS). Gram negative bacteria often shed OMVs during virulent infections 
in a process known as blabbing. OMVs can also be obtained from Gram negative bacteria via a 

15 number of chemical denaturation processes, such as detergent extraction. Synthetic OMVs or 
liposomes, comprising a lipid bilayer and typically enclosing an aqueous core, can also be 
prepared with the SARS viral antigens of the invention. 

The OMVs of the invention are preferably lipid vesicles comprising a lipid bilayer 
surrounding an aquous core. Typically the lipid vesicles are of unilamellar stracture (i.e., a 

10 single lipid bilayer surrounds the aquous core), although multilammelar lipid vesicles may also 
be used in the compositions of the invention. OMVs typically have sizes in the nanomolar to 
micromolar range, e.g., from 1 nM to 100 /tM, more typically from lOnM to 10 jiiM and 
preferably from 30 nM to 1 juM. 

The OMVs of the invention are preferably prepared from gram negative bacteria. Gram 

15 negative bacteria are those bacteria that fail to resist decolorization in the commonly known 
Gram staining method. Gram negative bacteria are characterized by a complex multilater cell 
wall and often possess an outer layer polysaccharide capsule. Gram negative bacteria suitable 
for producing OMVs include, for example, species from Neisseria, Moraxella, Kingella, 
Adnetobacter, Brucella, Bordetella, CKUanydia, Porphyronionas, Actinobacillus, Borelia, 

10 Serratia, Campylobacter, Helicobacter, Haemophilus, Escherichia, Legionella, Salmonella, 
Pseudomonas and Yersinia. 

The OMVs of the invention preferably comprise one or more SARS viral antigens or a 
fragment thereof. The SARS viral antigens may be recombinantly expressed in a Gram negative 
bacterial host cell and then harvested with the OMV. 

5 Antigenic components, such as recombinantly expressed SARS viral antigens, may be 

located in any or all of the three main compartments of the lipid vesicles, including attached to 
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either the interior or exterior surface of the Kpid vesicle, for example via a membrane anchor 
domain, or attachment to a lipid moiety; inserted into the lipid bilayer, for example where the 
antigenic component is itself a hydrophobic or lipid based entity; or located within the aqueous 
center or core of the lipid vesicle. 

Synthetically prepared OMVs, or liposomes, may be used in the invention. Such 
liposomes may comprise a number of different lipids and fatty acids. Suitable Upids for 
inclusion in liposomes of the invention include but are not limited to phophatidylinositoH4,5)- 
diphosphate, phosphatidylserine, phosphatidylchoUne, phosphatidylethanolamine, 
phosphatidyglycerol, cholesterol, beta-oleolyl-gamma-palmitoyl. lipopolysaccharides and 
galactocerbrosides. 

Suitable means for extraction of OMVs from bacterial sources include deoxycholate 
extraction, Tris/HCl/EDTA extraction, and lithium acetate extraction. Preferably, the extraction 
process comprises a physical and/or chemical means to disrupt the bacterial cell outer membrane 
in order to release sufficient OMVs for purification and isolation. See, e.g., WO 03/051379. 

The OMVs of the invention may be enriched and/or supplemented with antigenic 
components, such as SARS viral antigens, by methods known in the art, including, for example, 
direct combination in vitro where an energetic combination step can optionally be applied to 
facilitate integration of the antigenic component into a compartment of the liposome. Methods 
of energetic combination suitable for use in the invention include homogenization, 
ultrasonication, extrusion, and combinations thereof. 

Preferably, the antigenic component, such as the SARS viral antigen, is recombinantly 
produced by the host cell from which the OMV is derived. In one embodiment, such OMVs are 
prepared by intiroducing nucleic acid sequence encoding for the SARS viral antigen into the 
recombinant host cell. Preferably the nucleic acid sequence encoding for the SARS viral antigen 
is controlled by a strong promoter sequence. Preferably, tiie nucleic acid sequence encoding the 
SARS viral antigen further comprises an outer-membrane targeting signal. For example, the 
nucleic acid sequence encoding the SARS viral antigen may be fused to a sequence encoding for 
a naturally occurring outer membrane protein of the bacterial host. Preferably, the nucleic acid 
sequence encoding the SARS viral antigen is fiised to the signal peptide sequence of the 
naturally occurring outer membrane protein of the bacterial host. 

Methods of preparing an optimizing OMVs for use in vaccines are disclosed in, for 
example Filip et al, J. Bact. (1973) 115: 717-722; Davies et al, J. Irmnunol, Method (1990) 
143:215-225; and WO 01/09350. 

In one embodiment, a bacterial host cell, such as E. colt, are tiransfonned to express the 
SARS spike protein. As discussed above, the spike protein may be modified to facilitate 
bacterial expression and disport of the spike protein to the surface of the host cell. Each of the 
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Spike/NadA fusion constructs discussed above may be used in the OMV preparations of the 
invention. Preferably, constructs comprising the spike SI globular head domam fused to the 
stalkregion of NadA are used to generate OMVs. The construct may optionally include the 
NadA leader peptide as well as the NadA anchor peptide. Schematic diagrams of these preferred 
OMV constructs are depicted in FIGURE 49. 

Example 6 describes one method of preparing the OMVs of the invention. 
(b) Mammalian Expression ofSubunit SARS Vaccine 

As discussed above, mammaUan host cells may be used for recombinant ejtpression of 
SARS virus proteins. Mammalian host cells suitable for use in the invention include, for 
example, Chinese hamster ovary (CHO) cells. HeLa cells, baby hamster kidney (BHK) cells, 

monkey kidney ceUs Hep G2), Madin-Darby bovine kidney ("MDBK") cells, as well al 
others. Mammalian sources of cells include, but are not limited to, human or non-human primate 
ie.g., MRC-5 (ATCC CCL-171), WI-38 (ATGC CCL-75). human embryonic kidney cells {293 . 
cells, typically transformed by sheared adenovirus type 5 DNA), VERO cells from monkey 
kidiieys (including, for example COS7 cells), horse, cow {e.g.. MDBK cells), sheep, dog {e.g., 
MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit ' = 
number DSM ACC 2219 as described in WO 97/37001), cat. and rodent {e^g., hamster cells such 
as BHK21-F, HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be obtained 
from a wide variety of developmental stages, including for example, adult, neonatal, fetal, and 
embryo. 

The polynucleotides encoding the SARS viral proteins may be modified to facilitate or 
enhance expression. For example, conmiercial leader sequences known in the art, such as tPA or 
IgK or interleukin-2, may be used in the recombinant constructs. Preferably, however, the 
natural SARS leader sequence is used. Use of the natural leader sequence can be used to ensure 
that the protein will be trafficked in human cells in the same way as during a normal viral 
infection, which may be advantageous e.g. for DNA vaccines, where antigen is expressed m situ. 

As discussed above, tag sequences can be used in the expression constructs to facilitate 
purification, detection and stability of the expressed protein. Tag proteins suitable for use in the 
invention include a polyarginine tag (Arg-tag), polyhistidine tag (His-tag), FLAG-tag, Strep-tag, 
c-myc-tag, S-tag. calmoduUn-binding peptide, cellulose-binding domain, SBP-tag„ chitin- 
binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 
termination anti-terminiantion factor (NusA), E. coli thioredoxin (TixA) and protein disulfide 
isomerase I (DsbA). Preferred tag proteins include His-tag and GST. A full discussion on the 
use of tag proteins can be found at Terpe et al., "Overview of tag protein fusions: from 
molecular and biochemical fundamentals to commercial systems", Appl Microbiol Biotechnol 
(2003) 60:523-533. 
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After purification, the tag proteins may optionally be removed from the expressed fusion 
protein, i.e., by specifically tailored enzymatic treatments known in the art. Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor Xa. 

One or more amino acid sequences or amino acid domains of the spike protein may be 
removed to facilitate mammaUan recombinant expression. For instance, the entire S2 domain or 
the spike transmembrane region may be removed. Representative examples of some expression 
constiiicts of both fuU length and truncated spike glycoprotein suitable for mammaUan 
expression are shown in FIGURE 40. Polynucleotide sequences representing each constiiict are 
shown in SEQ ID NOS 6578-6583. A description of each annotation is shown below: 
Clone Name Description Expression Constnicf 

nSh natural leader sequence SEQ ID NO: 6578 

fuU length Spike 
histidinetag 

nS natural leader sequence SEQ ID NO: 6579 

fuU length Spike 

nShATC natural leader sequence SEQ ID NO: 6580 

Spike without transmembrane sequence 
histidine tag 

nSATC natural leader sequence SEQ ID NO: 6581 

Spike without transmembrane sequence 
nSlh natural leader sequence SEQ ID NO: 6582 

SI domain 

histidine tag 

nSl natural leader sequence SEQ ID NO: 6583 

SI domain 

Cloned cDNA fragments that encompass full-length Spike coding sequences, as well as a 
Spike construct deleted of the transmembrane and cytoplasmic domains (TM-Cy-deleted Spike) 
for secretion were inserted into an expression vector pCMVm to create nSh and nShATC, 
respectively. Both spike proteins were tagged with six histidine residues at the end of C- 
terminus to aid initial characterization of the expressed spike proteins. Similar sequences 
encoding full-length Spike or tiransmembrane and cytoplasmic domain deleted Spike, but witiiout 
the histidine "tag" are readily substituted by one of skill in the art. 

The likely locations of the expressed spike constiiicts was assessed by separating expressed 
proteins into an aqueous fraction (AF) and a detergent fraction (DF) using the procedure shown 
in Figure 48, with results of western blot analysis shown in Figure 43. The above described 
vector constructs were evaluated for expression after transfection into COS7 cells. The constiuct 
expressing the fuU length spike protein remained in the cell membrane while tiie constinct 
expressing the truncated spike protein was located either in the cytosol (Figure 43) or secreted 
into the cell medium (Figure 44). As shown in Figure 43, full-length spike protein is found in DF 
(membrane) in an aggregated form, while the truncated protein is found in AF (cytosol) as a 
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monomer. As shown in Figure 44, deleted proteins (ShATC) are secreted, and a small fraction of 
fiill-Iength spike protein is detected in the medium by rabbit serum, 

Recombinantly expressed spike proteins may be oligomerized. When the spike proteins 
are to be used in a vaccine or to generate antibodies specific to the spike protein, they are 
preferably oligomerized. In order to obtain oligomerized spike protein, it is preferred to 
maintain the transmembrane domain in the recombinant expression construct. For example, 
FIGURE 41 illustrates a western blot of COS! cell lysates comparing expressed nSh and 
nShATC using both anti-his tag and rabbit anti-SARS antibodies. As shown fiill-Iength (nSh) 
aggregates, but the truncated (nShATC) spike protein does not. Antibody raised against the His- 
tagged protein recognizes fiill-Iength and truncated spike proteins in native and reduced forms. 
Rabbit antiserum recognizes spike protein only in non-reducing conditions. Spike aggregates or 
oligomers were present in larger amounts in the cell lysates from the (expressed nSh constructs. 
Preferably, the oligomerized spike proteins form a homotrimer, as indicated in FIGURE 47 

A further experiment, illustrated in FIGURE 42, demonstrates that the oligomerization of 
the expressed nSh constructs is likely due to a non-covalent linkage (and is likely not due to, for 
example, a disulfide bond). The oligomer dissociates into monomers at elevated temperature 
(SO-IOO^C), but is stable in reducing conditions if not heated. 

It is fiirther preferred that recombinantly expressed spike proteins are glycoslyated. 
Tunicamycin and glycosidases were used to assess glycosylation. FIGURE 45 illustrates that 
glycoslation of expressed spike proteins is not affected by removal of the transmembrane domain 
region. Both full-length (Sh) and truncated (ShATC) S ARS spike proteins ate glycosylated. 

Preferably, expression of the constmcts of the invention is not toxic to the mammalian host 
cell. FIGURE 46 demonstrates that expression of the iUustrated spike constructs is not toxic to 
the COS7 host cell. 

Methods for transfecting, expressing, culturing, isolating and purifying recombinant 
proteins from mammalian cell cultures are known in the art. For example, the S ARS spike 
constructs of the invention may be expressed in 293 cells. These cells may be cultured and 
transfected in static or monolayer cultures. For rapid large-scale production of SARS protein 
antigens in sufficient quantities for in vitro and in vivo evaluation, including immunogenicity 
studies, large-scale transient transfection of 293 (human embryonic kidney) cells may be used to 
obtain milligram quantities of the recombinant antigen(s). Altematively, larger scale transfection 
of these cells may be performed with 293 cells in suspension culture. Preferably, the expressed 
SARS proteins are harvested from the transfected cells between 48 and 72 hours after 
transfection or even from 72 to 96 or more hours after transfection. 

Where the host cells are transfected with truncated spike expression constructs, the 
expressed spike protein is secreted from the host cells and collected from the cell media. After 
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concentration, the spike protein may be purified from the media using, for example, GNA lectin 
followed by DEAE and ceramic hydroxyapatite column chromatography. 

Where the host cells are transfected with full length spike expression constmcts. but rather 
IS retained within the cells, and may be purified from triton X-100 detergent extracted ceUs The 
full-length Spike protein can then be captured on GNA lectin, foUowed by hydroxyapatite and 
SP chromatography. 

Chinese Hamster Ovary (CHO) or other eukaryotic (e.g., mammalian) cells that stably 
express the SARS viral antigens of the invention may also be derived (e.g. Figure 73) " 
Preferably, the ceDs are CHO cells, and these constructs wiU comprise one or more marker or 
selection genes in order to select for the desired CHO cells. In one embodiment, the constiiicts 
compnse a CMV enhancer/promoter, ampicilUn resistance gene, and a fused DHFR and 
attenuated neomycin gene for selection purposes. Stable cell lines can then be produced using 
the neomycin selection system in CHOK-1 cells. Selected clones can then be sequenced to 
venfy tiie integrity of the insert, and ti-ansient transfections can then be perfomied using Trans- 
LTl polyamine transfection reagent (Pan Vera Corp., Madison, WD to assess the expression level 
and also the mtegrity of the expressed protein by EUSA and western blot analysis. 

Methods for derivation of CHO cells stably expressing the SARS viral antigens of the 
mvention comprise the steps of transfection and primary screening witi, selective medimn 
Optionally, these steps are followed by subcloning to assure pmity of cell lines. Cell culture 
supematants can be assayed using an antigen capture ELISA to quantify expression levels at all 
stages of selection and amplification. 

■ For full-lengtii Spike expression constiiicts. methanol fixed cells can be screened for 
internal expression by immunofluorescent staining using a rabbit anti-SARS antibody 
Successive measurements at the T75-flask stage of expansion can be employed to assure stability 
of expression levels. The molecular mass and integrity of tiie expressed proteins can be checked 
by PAGE both under native and reducing and denaturing conditions, followed by 
immunoprobing. 

In one embodiment, the pCMV3 vectors expressing S ARS-CoV Spike proteins in either 
full-length or tiimcated forms is intiroduced into CHOK-1 cells using the Trans-LT-1 reagent 
On day one, IxlO^' cells are plated on 100 mm dishes in non-selective F12 media + 10% Fetal 
Bovine Serum + 4 mM Glutamine. On day two, the cells are transfected with a DNA LT-1 
mixture and the media then replaced with complete F12 media. Twenty-four to forty-eight hours 
later dependmg on the cell density, each 100 mm dish is split to 4-6 100 mm dishes The 
medimn is changed to complete selective media containing Geneticin (neomycin) at 500 /ig/ml 
All bovme senrni used in tiiese procedures is from TS]^free sources that meet current FDA 
standards. Twenty-four hours later tiie medium is changed to complete selective medimn plus 
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500 ug/ml neomycin. Ten to fourteen days later, individual colonies are picked and transferred 
to 96 well plates and cultured in complete selective medium but without G418. When 
approximately 80% of the wells are confluent, twenty-four hour supematants are screened by 
spike capture ELISA positive clones are transferred to twenty-four well plates. For the initial 
expression of full length Spike protein, methanol fixed cells will be screened by 
immunoflourescent staining using a rabbit anti-SARS antibody. After the low expressing cell 
Unes have been eUminated and there are less than 20-30 cell lines, capture EOS A and westems 
will be used to determine the expression level after cell lysis. A portion of each cell line will be 
pelleted, weighed and lysed in 1% triton lysis buffer containing MOPS, NaCl and MgCh at the 
same ratio of cell weight to lysis buffer. After lysis the supernatant is collected and expression 
level is determined. Three to four clones producing the highest levels of spike protein in coirect 
structure and conformation will be grown in three-liter bioreactors for expansion and adaptation 
to low serum suspension culture conditions for scale-up. 

The antigen capture ELISA assay for the SARS spike protein can be performed as 
described in the art. A brief description of this assay follows. 96 well flat-bottom plates = 
(Coming, Coming, NY) are coated with 250ng per well of purified inununoglobulin obtained 
from rabbit sera that were immunized with inactivated SARS virus. Between steps, the plates 
are washed in a buffer containing 16%NaCl and 1% Triton XIOO. lOOjiL of supernatant or 
lysate samples (diluted in a buffer containing lOOmM NaP04, 0.1% Casein, ImM EDTA, 1% 
Triton XIOO, 0.5M NaCl and 0.01% Thiomersal, pH 7.5) are added and incubated for 2 hours at 
37°C. Bound antigen is reacted against pooled SARS+ve serum or high affinity monoclonal 
antibody either human or mouse against SARS spike protein (1 hour incubation, 37°C) and 
detected using appropriate species-specific peroxidase conjugated second antibody (30 minute 
incubation at 37°C; TAGO, Burlingame, CA). The plates are developed for 15 minutes at room 
temperature using TMB substrate (Pierce, Rockford, IL) and the reaction stopped using 4N 
phosphoric acid. The plates are read at a wavelength of 450nm and the concentration of protein 
per ml sample is derived from a standard curve (OD v*. protein concentration) based on serial 
dilutions of a known concentration of recombinant spike protein. 

The immunoprobing analysis can also be performed following the standard methods 
described elsewhere in the art. A brief description follows. 10-20 pil of the sample is analyzed 
on 4-20% SDS PAGE under non-reducing/ denaturing conditions with mild heating. The gels 
are run for 1.5-2.0 hours at 100 V constant voltage. The proteins are then transferred onto 
nitrocellulose membranes (Millipore, Bedford, MA) for 45 min using the semidry western 
transfer system (BioRad, Heaicules, CA) following the manufacturer's instructions. The 
membrane is then reacted against polyclonal anti-spike rabbit serum, followed by anti-rabbit Ig 
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conjugated to Alexa 688 (Molecular Probes. Oregon). The blots are scanned using an inftared 
imaging system OJ-Cor, Inc., Lincoln, Nebraska). 

"ThcWghestexpressingcandidateceniinescanbescreenedforspikeproteinexpression 
and stability in small-scale (3 Hter) suspension cultures. The candidate clone can be iteher 

. evaluated for level of expression as well as integtity of expressed protein after amplification, and 
subsequenfly tested for expression stabihty in the absence of selection. Tl,e selected clones can 
also be tested for maintenance of the DNA sequence integdty of the integnued SARS spike 
protem gene. To quickly monitor the expression levels in smaU flask (T25 or T75) and in the 
three liter evaluation cultures, a lectin-based process (Gluvanthus NivaUs lectin) may be used to 

I '^ol^'te SARS spike protein to a degree ofpmity that allows semi-quantitation and 

characterization of the protein in CHO supernatant. For fiaU-length spike protein it will be 
obtainedflom.ritonX-100detergen.exh:ac.edcells. H-U-length Spike protein will be then 
captured on GNA lectin, followed by hydroxyapatite and SP chromatograph. Eluted protein is 
ften characterized by: 1) polyacrylamide gel electrophoresis (PAGE) and Coomassie staimng 2) 
lmm»,op.obing with anti-SARS rabbit sera, 3) structural characterization using size exclusi™ 
chromatography (SEC), as weU as mass spec analysis using MALDI-TOF 

Routes and methods of immunizaUon of the vaccines of the inventionarediscussedin 
more detaU in a section below Examples 7 to 9 illustrate sample immumzation protocols for the 
recombinant spike proteins. 
Vaccine testing 

Pdor to human admimstration, it is normal to test vaccines in animal models. A mouse 
model ofSARScoronavirusinfectionis known (Subbarao««Z. (2004) y F,V<,n8:3572-77) and 
other animals that may be used as models of infection and/or disease include fmets and ' 
monkeys. Thus the invention provides a non-human animal that is hrfected by the SARS 
c^avirus.whereintheanimalispreferablyafer,e,oraprima.e(..^. a monkey or a macaque) 
The ammal may be gnotobiotic. The animal is preferably not a ca, (Felis domesUcus) Tl,e 
ammal may or may not display SARS disease symptoms e.g. fenets (Mustela furo) show 
promment pulmonary pathology after infection. See: Martina et al. (2003) Nature 425:915. 
g. PolynucleoM^ encodino the ,'MJ?.-; Anfi.,„. ^ f„,„ w„nv,„ 

T J" 'T"™ ---"""g for the SARS antigens of the invend<m 
m addtuon. the invention includes polynucleotides which have been optimized for recombinant 
production (e.g. codon optimizaUon) of fl,e SARS antigens of the invention, including 
polynucleotides encoding for each of the SARS fusion constracts discussed above 
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F. Viral vector or Viral Panicle deliver y of the SARS Antigens of the Invention 

The antigens of the invention may be expressed in vivo or in vitro by polynucleotides 
encoding the antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 

Gene-based delivery systems derived from viruses, such as alphaviruses, are useful for the 
ex vivo and in vivo administration of heterologous genes, including one or more SARS genes, 
having therapeutic or prophylactic apphcations. These systems can also be used for the 
production of recombinant proteins derived from the SARS virus in cultured cells. Gene-based 
dehvery systems of the invention include viral vectors (e.g., adenovirus vector, poxvirus vector, 
alphavirus vector) and non-viral nucleic acid vectors {e.g., DNA, RNA) encoding one or more 
- SARS virus antigens. Polynucleotides encoding SARS virus antigen(s) are incorporated into the 
gene-based vaccines individually or in combination (e.g., as bicistronic constructs). 
1. Alphavirus 

Alphaviruses are members of togaviridae family and share common stinctural and 
repUcative properties. Sindbis virus (SIN) is tiie prototype virus for the molecular stiidy of other 
alphaviruses, and togetiier with Venezuelan equine encephalitis virus (VEE) and Semliki Forest 
virus (SFV), are the most widely utilized alphaviruses being developed into expression vectors 
for heterologous genes (Schlesinger and Dubensky (1999) Curr Opin. Biotechnol. i0:434-439; 
Schlesinger (2001) Expert Opin. Biol. Ther. 1 : 177-91). 

Alphaviruses possess a relatively small single-sti-anded RNA genome of positive polarity, 
which is approximately 12 kb in length, capped and polyadenylated. The RNA interacts with 
viral capsid protein monomers to form nucleocapsids, which in turn, are surrounded by a host 
cell-derived hpid envelope from which two viral glycoproteins, El and E2, protrade forming 
"spike" tiimers of heterodimeric subunits. Two open reading frames (ORFs) encode as 
polyproteins the enzymatic nonstiucfajral replicase proteins (5' ORE) and the virion structural 
proteins (3' ORE). The structural polyprotein is tiranslated from a highly abundant subgenomic 
mRNA, which is transcribed from a strong internal alphavirus promoter (Strauss and Strauss 
(1994) Microbiol. Rev. 58:491-562). RepHcation of the genome occurs exclusively witiiin die 
host cell cytoplasm as RNA. 

The most conmion alphavirus expression vectors have exploited botii the positive-stranded 
nature and modular organization of die RNA genome. These vectors, termed "replicons" due to 
theu- property of self-amplification, permit insertion of heterologous sequences in place of the 
stiiictural polyprotein genes, while maintaining the 5'- and 3'-end cis replication signals, die 
nonstiucfairal replicase genes, and die subgenomic junction region promoter (Xiong et al. (1989) 
Science 245:1188-1191; Liljestirom (1991) Bio/Technology 9:1356-1361). Chimeric alphavirus 
vectors (and particles) from sequences derived from divergent virus families have also been 
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described, (see. for example United States patent application serial number 09/236 140- see also 
US Patents 5,789.245. 5.842.723. 5.789.245. 5.842.723, and 6,015,694; as well as WO 95/07994 
WO 97/38087 and WO 99/18226). Co-owned International Publication WO 02/099035 
pubHshed December 12. 2002 and incorporated by reference in its entirety herein, describes 
> chimeric alphavirus molecules and modified alphavirus molecules having modified Biosafetv 
Levels. ^ 

The absence of structural protein genes renders alj^avirus replicon vectors defective in 
that RNA amplification and high-level heterologous gene expression occurs within the target' 
ceU, but ceU-to^ell spread of vector is not possible due to the inability to form progeny virions 
I Through the years, several synonymous terms have emerged that are used to describe alphavirus 
repHcon particles. tenns include recombinant vir^ particle, recombinant alphavirus 

particle, alphavirus replicon particle and .eplicon particle. However, as used herein, these terms 
all refer to a virion-like unit containing an alphavirus-derived RNA vector replicon. Moreover 
these tenns may be referred to collectively as vectors, vector constructs or gene delivery vectotL 
Packaging of repUcon RNA into particles can be accomplished by introducing the replicon 
RNA mto pennissiye cells (e.g., RNA or DNA transfection, or particle infectton) that also 
contain one or more structural protein expression cassettes or "defective helper" constructs 
encoding the alphavirus structural proteins. These structural protein encoding constructs may 
themselvesbeintroducedtotothecellsbytransfectionofeitherRNAorDNA andmost 
commonly retain the native alphavirus subgenomlc ptomoter. as well as 5-- and 3--end ds 
signals forco-ampliflcation with therepUcon, but are devoid of any replicase genes and the RNA 
packagmg signal (liljestrom 0.99XyBio/Tec}mology P:1356-1361; Pushko e, al. (1997) Virolo^ 
239:?S9-A01; Polo « al. (1999) PNAS 56:4598-4603). Permanent cell lines that ate stable 
transfomKd with constructs expressing the alphavirus structural proteins (.e.g., packaging cell 
to«)offerameans to avoid transient transfection production methods (P6Io«a/. (1999) iWAS 

The present invention includes compositions and methods for the production of replication 
defective viral vector particles (e.g., alphavirus replicon particles) for use in the evv,v<, and in 
wo administration of heterologous genes encoding proteins having therapeutic or prophylactic 
apphcation. including genes encoding for one or more SARS viral anUgens. 

In one aspect, the invention includes a method of producing replication defective viral 
vector parttcles (e.g.. alphavirus repUcon particles) comprising the steps of introducing at least 
one nucleic acid molecule comprising a viral vector (e.g.. alphavirus repMcon RNA) into 

.mmortalized cells of the present invention, under conditions that aUow for complementarion of 
the viral vector (e.g., alphavirus repHcon RNA) and production of viral vector particles (. g 
alphavirus repBcon particles), and isolating die viral vector particles from the cells or cell culture 
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supematants. In certain embodiments, the irimoitaliMd ceUs are grown in sus^^^ 
example PERC.6 ceUs. In other embodiments, the methods are performed in large-scale 
volmnes, for example, liter volumes or greater, such as for example in roller bottles, large flasks. 
Nunc Cell Factories, Coming Cell Cubes, fermentation vessels, etc). 

In certain embodiments, the viral vector is an alphavirus replicon RNA that requires 
complementation by providing one or more alphavirus structural proteins in trans, within the 
immortalized cell. In such instances, the methods of complementation to produce alphavirus 
replicon particles may involve the introduction of one or more nucleic acids (e.g., RNA, DNA) 
encoding said alphavirus structural protein(s) (e.g., capsid and/or envelope glycoproteins) into 
the immortalized cells, either transiently or stably, and either concurrent with or prior to the 
introduction of the alphavirus repUcon RNA. In certain embodiments, the alphavirus replicon 
RNA is introduced into the cell by transfection an in vitro transcribed RNA. In other 
embodiments, the alphavirus replicon RNA is introduced into the cell by transfection of a DNA 
(e.g., ELVIS), which is capable of transcribing within the cell, the replicon RNA. In yet other 
embodiments, the alphavirus replicon RNA is introduced into the cell by infection with a seed 
stock of alphavims replicon particles. In certain embodiments, the nucleic acids encoding said 
alphavirus structural protein(s) are defective helper RNA or are DNA that can transcribe within 
the cell defective helper RNAs. 

As discussed herein, "alphavirus RNA replicon vector", "RNA replicon vector", "replicon 
vector" or "replicon" refers to an RNA molecule that is capable of directing its own 
amplification or self-replication in vivo, within a target ceU. To direct its own amplification, the 
RNA molecule should encode the polymerase(s) necessary to catalyze RNA amplification (e.g., 
alphavirus nonstmctural proteins nsPl, nsP2, nsP3, nsP4) and also contain cis RNA sequences 
required for replication which are recognized and utilized by the encoded polymerase(s). An 
alphavirus RNA vector replicon should contain the following ordered elements: 5' viral or 
ceUular sequences required for nonstructural protein-mediated amplification (may also be 
referred to as 5' CSE, or 5' cis replication sequence, or 5' viral sequences required in cis for 
replication, or 5' sequence which is capable of initiating transcription of an alphavirus), 
sequences which, when expressed, code for biologically active alphavirus nonstructural proteins 
(e.^., nsPl, nsP2, nsP3, nsP4), and 3' viral or cellular sequences required for nonstructural 
protein-mediated amplification (may also be referred as 3' CSE, or 3' viral sequences required in 
cis for replication, or an alphavirus RNA polymerase recognition sequence). The alphavirus 
RNA vector repUcon also should contain a means to express one or more heterologous 
sequence(s), such as for example, an IRES or a viral (e.g., alphaviral) subgenomic promoter 
(e.g., junction region promoter) which may, in certain embodiments, be modified in order to 
increase or reduce viral transcription of the subgenomic fragment, or to decrease homology with 
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defective helper or structuial protein expression cassettes, and one or more heterologous 
sequence(s) to be expressed. Preferably the heterologous sequence(s) comprises a protein- 
encoding gene, which is the 3' proximal gene within the vector repKcon. And preferably the 
replicon further comprises a polyadenylate tract. 
5 As discussed herein, "recombinant Alphavims Particle", "alphavirus replicon particle" and 

"replicon particle" refers to a virion-like unit containing an alphavims RNA vector replicon. 
GeneraUy. the recombinant alphavims particle comprises one or more alphavims stracti^al 
proteins, a Upid envelope and an RNA vector replicon. Preferably, the recombinant alphavims 
particle contains a nucleocapsid stracture that is contained within a host cell-derived lipid 
10 bUayer. such as a plasma membrane, in which one or more alphaviral envelope glycoproteins 
(e.g., E2, El) are embedded. The particle may also contain other components (e.g., targeting 
elements such as biotin, other viral structural proteins or portions thereof, hybrid envelopes, or ■ 
other receptor binding ligands), which direct the tropism of the particle from which the 
alphavims was derived. Generally the interaction between alphavims RNA and stractural 
15 protein(s) necessary to efficiently fomi a repUcon particle or nucleocapsid may be an RNA- 
protein interaction between a capsid protein and a packagmg signal or packaging sequence 
contained within the RNA. 

"Alphavims packaging cell line" refers to a cell which contains one or more alphavims 
stoictural protein expression cassettes and which produces recombinant alphavims particles 
10 (replicon particles) after introduction of an alphavims RNA vector replicon, eukaryotic layered 
vector initiation system, or recombinant alphavims particle. The parental cell may be of 
mammalian or non-mammaUan origin. Within preferred embodiments, the packaging cell line is 
stably transformed with the stmctural protein expression cassette(s). 

"Defective helper RNA" refers to an RNA molecule that is capable of being amplified and 
15 expressing one or more alphavims stiiictural proteins within a eukaryotic ceU, when that cell also 
contains functional alphavims nonstiiictural "repUcase". proteins. The alphavims nonstmctiiral 
proteins may be expressed within the cell by an alphavims RNA replicon vector or other means. 
To permit amplification and stiiichiral protein expression, mediated by alphavims nonstmctural 
proteins, tiie defective helper RNA molecule should contain 5'-end and 3'-end RNA sequences 
.0 required for amplification, which are recognized and utilized by the nonstmctural proteins, as 
weU as a means to express one or more alphavims stiiictural proteins. Thus, an alphavims 
defective helper RNA should contain the following ordered elements: 5' viral or cellular 
sequences required for RNA amplification by alphavims nonstiructural proteins (also referred to 
elsewhere as 5' CSE, or 5' cis replication sequence, or 5' viral sequences required in cis for 
5 repUcation, or 5' sequence which is capable of initiating transcription of an alphavims), a means 
to express one or more alphavims stiiictural proteins, gene sequence(s) which, when expressed, 
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codes for one or more alphavims structural proteins {e,g., C, E2, El), 3' viral or cellular 
sequences required for amplification by alphavims nonstructural proteins (also refeired to as 3' 
CSE, or 3' viral sequences required in cis for replication, or an alphavims RNA polymerase 
recognition sequence), and a preferably a polyadenylate tract. Generally, the defective helper 
RNA should not itself encode or express in their entirety all four alphavims nonstmctural 
proteins (nsPl, nsP2, nsP3, nsP4), but may encode or express a subset of these proteins or 
portions thereof, or contain sequence(s) derived jfrom one or more nonstructural protein genes, 
but which by the nature of their inclusion in the defective helper do not express nonstmctural ' 
protein(s) or portions thereof. As a means to express alphavims stmctural protein(s), the 
defective helper RNA may contain a viral {e.g., alphaviral) subgenomic promoter which may, in 
certain embodiments, be modified to modulate transcription of the subgenomic fragment, or to 
decrease homology with replicon RNA, or alternatively some other means to effect expression of 
the alphavims stmctural protein {e.g., internal ribosome entry site, ribosomal readthrough 
element). Preferably an alphavims structural protein gene is the 3' proximal gene within the 
defective helper. In addition, it is also preferable that the defective helper RNA does not contain . 
sequences that facilitate RNA-protein interactions witii alphavims" stmctural protein(s) and 
packaging into nucleocapsids, virion-like particles or alphavims replicon particles. A defective 
helper RNA is one specific embodiment of an alphavims stmctural protein expression cassette. 

Alphavims for use in the invention may be grown in any one of the cell lines discussed 
above as suitable for the SARS vims. 

Alphavims repUcon particles may be produced according to the present invention by using 
the above cell lines {e.g., immortalized cell lines) and a variety of published and accepted 
alphavims vector methodologies. Such methodologies include, for example, tiransient packaging 
approaches, such as the co-tiransfection of in vitro tiranscribed replicon and defective helper 
RNA(s)(Liljest:om,S/o/rgc/ino/ogy 9:1356-1361, 1991; Bredenbeek era/.. /. Virol 67:6439- 
6446, 1993; Frolov et al, J. Virol 7i :2819-2829, 1997; Pushko et al. Virology 259:389-401, 
1997; US Patents 5,789,245 and 5,842,723) or co-transfection of plasmid DNA-based repHcon 
and defective helper constiuct(s) (Dubensky etcd., J. Virol 70:508-519, 1996), as weD as 
introduction of alphavims stiiictiiral protein expression cassettes {e.g., DNA-based defective 
helper) into immortalized cell lines of the present invention to create stable packaging cell Unes 
(PCL) (Polo etal, PNAS 96'A59^A6Q3, 1999; US Patents 5,789,245, 5.842,723, 6,015,694; WO 
97/38087, WO 99/18226, WO 00/61772, and WO 00/39318). Stable packaging cell lines may 
then be utiKzed for alphavims repUcon particle production. The PCL may be ti-ansfected with in 
vitro ti-anscribed alphavims replicon RNA, transfected with a plasmid DNA-based replicon (e.g., 
ELVIS vector), or infected with a seed stock of alphavirus replicon particles, and then incubated 
under conditions and for a time sufficient to produce progeny alphavims replicon particles in the 
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culture supernatant. In addition, progeny replicon particles can subsequently be passaged in 
additional cultures of naiVe PCL by infection, resulting in further expansion and commercial 
scale preparations. Importantly, by using defective helper RNA or stable PCL based on the 
"split" structural gene configuration, these replicon particle stocks may be produced free from 
detectable contaminating RCV. 

Following harvest, crude culture supematants containing the chimeric alphavirus repUcon 
particles may be clarified by passing the harvest through a filter (e.g.. 0.2 uM. 0.45 uM. 0.65 uM. 
0.8 uM pore size). Optionally, the crude supematants may be subjected to low speed 
centrifugation prior to filtration to remove large ceU debris. Within one embodiment, an 
endonuclease ^e.g., Benzonase, Sigma #E8263) is added to the preparation of alphavirus replicon 
particles before or after a chromatographic purification step to digest exogenous nucleic acid. 
Further, the preparation may be concentrated prior to purification using one of any widely known 
methods ie.g., tangential flow filtration). Crude or clarified alphavirus replicon particles may be 
concentrated and purified by chromatographic techniques (e.g., ion exchange chromatography ■ 
size exclusion chromatography, hydrophobic interaction chromatography, affinity 
chromatography), such as those described in WOOl/92552, incorporated by reference in its 
entirety herein. Two or more such purification methods may be performed sequentiaUy 

EXAMPLE OF ALPHAVIRUS REPLICON PARTICLES ENCODING SARS VIRUS SPIKE (S) 
ANTIGEN 

The invention includes compositions and methods for the production of replication 
defective viral vector particles (e.g., alphavirus replicon particles) for use in the ex vivo and in 
VIVO administration of heterologous genes encoding proteins having therapeutic or prophylactic 
appHcation, including genes encoding for one or more SARS viral antigens. 

The following example illustrates a method of preparing alphavirus repUcon particles 
encoding SARS virus spike (s) antigen. 

The SARS virus spike gene can be incorporated into alphavims replicon particles derived 
from a variety of alphavims, such as Sindbis virus. Semliki Forest virus (US 5739026) 
Venezuelan equine encephalitis vims (US 6531135). and replicon particle chimeras derived from 
more than one alphavims (US 6376236. WO 02/99035). In addition, the SARS vims spike gene 
can be incorporated in its entirety (encoding full-length spike protein) or in a modified form that 
includes, for example, sequence deletions or tmncations, such that the encoded a spike protein is 
of less than full-length (e.g. C-terminal tmncation of one or more (e.g. at least 1. 2. 3, 4, 5. 6, 7. 
8. 9, 10. 15, 20, 25, 30 etc.) amino acids, deleted of transmembrane region and c^tJpllsmic til). 

For example, the spike gene may be cloned as a full-length gene into the VCR-chim2 1 
vector (WO 02/99035) by standard RT-PCR conditions or by standard subcloning from one of 
the other plasmids described herein, using commercially available restriction endonucleases. For 
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the reverse transcription step in standard RT-PCR, the Superscript pre-ampUfication kit 
(Invitrogen™) and the primer SEQ ID NO: 7325 (sp-RT-R) are used: 

For the amplification step, the cDNA polymerase advantage kit (Clonetech) and two 
primers Sp-F-BbvCI (SEQ ID NO: 7326) and Sp-R-Nofl (SEQ ID NO: 7327) are used: 

The forward primer is designed to contain the ccacc sequence (Kozak, 1991 JBC 19867- 
70) in front of the ATG codon to optimize translation efficiency of the spike gene. Also, the 
forward primer contains the BbvCI restriction site and the reverse primer contains the NotI 
restriction site for subsequent cloning of the PGR ampUfied gene. 

The PGR product is purified using the QIAquick Nucleotide Removal kit (QIAgen), 
digested with BbvGI and NotI, gel purified with QIAquick Gel Extraction kit (QIAgen), and 
ligated to plasmid VGR-Ghim2.1 pre-digested with the same enzymes. Glones containing the 
SARS spike sequence are verified by sequencing and the new construct is called VGR-Ghim2.1- 
SARSspike. 

To generate VEErep/SINenv-SARSspike replicon particles the plasmids VGR-Ghim2.1- 
SARSspike, VCR-DH-Scap (WO 02/99035). and VCR-DH-Sglydll60 (WO 02/99035) are 
linearized with the restriction enzyme Pmel and used for in vitro transcription as described 
previously (Polo et al 1999. PNAS 96: 4598-603; WO02/99035). The transcripts are c6- 
transfected into BHK cells as previously described (Polo et al., 1999, ibid.; WO02/99035). The 
transfected cells are incubated at 34 °G, the supematants collected at 20 and 30 hrs post- 
elecfroporation, clarified by centrifugation, and purified by chromatography as previously 
described (WO 01/92552). 

Expression of the SARS spike protein fijom the replicon particle vector is verified by 
infecting BHK ceUs overnight with purified VEErep/SINenv-SARSspike or VEErep/SINenv- 
GFP (WO 02/99035) replicon particles. In addition, BHK cells also were transfected in parallel 
with in vitro transcribed VCR-Chim2.1-SARSspike replicon RNA. At 16 hrs post-infection and 
transfection cells are lysed and a sample of the lysate analyzed by western blot using an antibody 
that recognizes SARS virus spike protein. The proteins on the gel are stained or transferred to a 
membrane for Westem blot analysis with sera fix)m convalescent patients or altematively murine 
or rabbit antisera generated against SARS vims. VEErep/SINenv-SARSspike replicon particles 
are administered to the vaccine recipient (e.g., rodent, non-human primate, human) as described 
elsewhere in the present invention. 

Figure 67 shows data from westem blot analysis performed under non-reducing conditions, 
using a SARS virus specific rabbit polyclonal antisera. The westem data demonstrate that not 
only is SARS spike protein expressed in cells infected with alphavirus replicon particles or 
transfected with replicon RNA, but the predominant form of spike is that of a homotrimer 
(Fig.67A). Similar homotrimeric association of the spike protein was observed in westem blots 
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of SARS Virions purijBed from SARS virus infected VERO ceD supematants, and this 
homotrimer is heat labile, as indicated by the dissociation into monomeric forms at 80°C and 
100°C (Fig.67B). 

To further characterize SARS Spike protein expression and processing following 
expression from alphavirus replicon vectors, BHK-21 cells were infected with alphavirus 
replicon particles expressing the fuU-length Spike. At 6 hr post-infection with an MOI of 5, 
infected cells were labeled for 1 hr with L-['^S]methionine/cysteine and chased for the indicated 
time. The [^^S]-labeled spike protein was inmiunoprecipitated by anti-SARS rabbit serum and 
digested with Endo-H. Both digested and undigested proteins were analysed by 4% 
polyacrylamide-SDS PAGE under reducing conditions. As shown in Figure 55, the full-length 
spike protein is synthesized as an Endo-H sensitive high mannoSe glycoprotein (gpl70, an ER 
form) that undergoes modification to an Endo-H resistant glycoprotein with complex 
oligosaccharides (gpl80, a Golgi form). The conversion of gpl70 into the gpl80 form takes 
place within 2 hr. 

Alphavirus replicon particles expressing one or more SARS proteins (e.g., 
VEErep/SINenv-SARSspike repUcon particles) are administered to the vaccine recipient in order 
to induce a SARS specific immune response (e.g., rodent, ferret, non-human primate, human) as 
described elsewhere in the present invention. Immunization may be performed through a variety 
of routes, including for example, intramuscular, subcutaneous, intradermal, and intranasal. In . 
additon, the alphavirus replicon particles may be used alone or in combination (e.g., "prime- 
boost") with other vaccine approaches of the present invention, or alternatively the alphavirus 
replicon particles may co-express antigen from other respiratory pathogens or be co-administered 
in combination with alphavirus replicon particles expressing antigens from other respiratory 
pathogens (e.^:,. influenza virus, parainfluenza virus, respiratory syncytial virus, human 
metapneumovirus). For example, the induction of anti-spike protein antibodies in animals 
immunized IM with VEErep/SINenv-SARSspike repUcon particles was demonstrated in mice 
(Figure 68). These mouse studies also included addtional vaccine groups for comparison, 
including the inactivated SARS virus and recombinant truncated spike protein vaccines describe 
elsewhere herein, as well as plasmid DNA used as a prime, followed by alphavirus repHcon 
particles as a boost. The data clearly show very potent inmiune responses for all vaccine groups, 
including the alphavirus replicon particle group. It should be noted that the level of antibody 
induced by the inactivated SARS virus vaccine used in these experiments has been shown to be 
protective in a SARS virus animal challenge model. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 
membrane glycoprotein) are cloned into alphavinis replicon vectors, either individuaUy or in 
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combination, to generate alphavirus replicon particles according to the teachings of the present 
invention and using standard molecular biology techniques.. 

EXAMPLE OFALPHAVmUS-BASED PLASMID DNA EXPRESSING SARS VIRUS SPIKE 

(S) 

The invention includes preparation of plasmid DNA expressing a SARS virus antigen for 
prophylactic or therapeutic immunization against SARS virus infection. In one embodiment, the 
SARS viral antigen is a spike (S) protein. In one embodiment, the plasmid DNA is alphavinls- 
based. 

The following example illustrates one method for preparing an alphavims-based plasmid 
DNA expressing SARS virus spike (S). 

SARS spike gene can be dehvered using any of the alphavirus-based plasmid DNA 
replicons such as ELVS (Dubensky et al, 1996 J Virol. 70: 508-19), SINCP (WO 01/8160?) or 
VCP (PCT WO 02/99035). 

For example, the SARS spike gene is cloned into SBSTCP using the standard RT-PCR 
techniques. The oligo Sp-RT-R is used for the reverse transcription step with the Superscript pre- 
amplification kit (Invitrogen). For the amplification step, the cDNA polymerase advantage kit 
(Clonetech) with the Sp-R-Nofl and Sp-F-Xhol (SEQ ID NO: 7328) primers is used. . 

The Sp-F-Xhol primer was designed to contain the ccacc sequence in front of the ATG 
codon to optimize translation efficiency (Kozak 1991, ibid) of the spike gene. Also, the primer 
contains the Xhol restriction site for the subsequent cloning of the PGR amplified gene. 

The PGR product is purified using the QIAquick Nucleotide removal kit, digested with 
Xhol and Noa, gel purified with QIAquick Gel Extraction kit, and ligated to plasmid SINCP pre- 
digested with the same enzymes. Clones containing the SARS spike sequence are verified by 
sequencing and the new construct is called SINCP-SARSspike. 

Expression of the SARS spike gene is verified by transient transfection of BHK cells with 
2^g of either plasmid DNA SINCP-SARSspike or SINCP pre-incubated for 5 minutes with 5 jil 
of Transrr Polyamiiie reagent (Miirus) in low serum medium Optimem (Life Technologies). At 
48 hrs pos-transfection ceUs are lysed and a sample of the lysate is ran on 8% SDS-PAGE. The 
proteins on the gel are either stained or transferred to a membrane for Western blot analysis with 
sera from convalescent patients, or alternatively with sera fi-om mouse or rabbits. 

SINCP-SARSspike plasmid replicon is administered to the vaccine recipient (e.g., rodent, 
non-human primate, human) as a formulated or unformulated plasmid vaccine, alone or in 
combination (e.g., "prime-boost") with other vaccines of the present invention, as described 
elsewhere herein. 

Similarly, genes encoding other SARS virus antigens (e.g.. nucleocapsid protein, 
membrane glycoprotein) are cloned into alphavirus plasmid replicon vectors. 

-148- 



10 



to 



WP 2004/092360 PCT/US2004/011710 
2. Plasmid Expression Vectors 

EXAMPLE OF PLASMID DNA EXPRESSING SARS VIRUS SPIKE (S) 

The following example illustrates a method for preparing plasmid DNA expressing SARS 
virus spike (s). 

The SARS virus spike antigen also may be delivered using other plasmid DNA expression 
vectors (sometimes referred to as "conventional" DNA vaccines), based on a polymerase H 
promoter, such as, for example, a CMV promoter. A DNA vaccine of the spike antigen gene _ 
induces an antibody response in mice (Zhao et al. (2004) Acta Biochun et Biophysica Sinica 
36:37-41), and has been found to induce viral neutraHzation and protective immunity in mice 
(Yang et al. (2004) Nature 428:561-564), particularly when truncated at the C-tenninus. 

For example, the SARS spike gene is cloned into pCMVKni2 (Zur Megede et al, J.Virol., 
74:2628-2635, 2000; SEQ ID NO: 9923) using standard RT-PCR techniques. The oligo Sp-RT-' 
R is used for the reverse transcription step with the Superscript pre-amplification kit 
(Invitrogen). For the amplification step, the cDNA polymerase advantage kit (Clonetech) is.used 
with primers Sp-F-EcoRI (SEQ ID NO: 7329) and Sp-R-Xbal (SEQ ID NO: 7330). 

The forward primer was designed to contain the CCACC (SEQ ID NO: 7331) sequence in 
front of the ATG codon to optimize translation efficiency (Kozak 1991, ibid.) of the spike gene. 
Also, the forward primer contains the EcoRI restriction site and the reverse primer contains the 
Xbal restriction site for the subsequent cloning of the PGR amplified gene. 

The PGR product is purified using the QIAquick Nucleotide Removal kit, digested with 
Xhol and Nofl, gel purified with QIAquick Gel Extraction kit, and ligated to plasmid 
pGMVKm2 pre-digested with the same enzymes. Glones containing the SARS spike sequence 
are verified by sequencing and the new construct is called pGMVKm2-SARSspike, 

Expression of the SARS spike gene is verified by transient transfection of BHK or 293 
cells with 2jig of either plasmid DNA pGMVKm2-SARSspike or pGMVKm2 pre-incubated for 
5 minutes with 5 of TransIT Polyamine reagent (Mirrus) in low serum medium Optimem (life 
Technologies). At 48 hrs pos-transfection cells are lysed and a sample of the lysate is run on 8 % 
SDS-PAGE. The proteins on the gel are either stained or transfeixed to a membrane for Western 
blot analysis with sera firom convalescent patients, or alternatively using mouse or rabbit 
antisera. 

Plasmid pGMVKm2-SARSspike is administered to the vaccine recipient (e.g., rodent, non- 
human primate, human) as a formulated or unformulated plasmid vaccine, as described 
elsewhere in the present invention. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 
membrane glycoprotein) are cloned into plasmid expression vectors 
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3. Virus-Like Particles comprisin g SARS antig ens 

The SARS viral antigens of the invention may be fonnulated into Vmis like Particles 
('•VLPs"). The invention thus includes virus-like particles (or VUPs) comprising one or more 
SARS viral antigens. Preferably, the VIPs comprise one or more SARS yiral antigens selected 
ftom the group consisting of Spike (S), nucleocapsid (N). membrane (M) and envelope (E). 
Preferably, the VLPs comprise at least M and E. 

The VIPs of the invention comprise at least one particle-foiming polypeptide. Said 
particle-forming polypeptide is preferably selected from a Coronavirus structural protein." In one 
embodiment, the particle-forming polypeptide is selected from one or more SARS viral antigens 
In another embodiment, the particle-forming polypeptide is selected from the structural protein • 
of a non-SARS Coronavirus. such as. for example. Mouse Hepatitis Virus. 

VLPs can be formed when viral structural proteins are expressed in eukaryotic or 
prokaryotic expression systems. Upon expression, the structural proteins self-assemble to form 
particles. Alternatively, viral stmctural proteins may be isolated from whole virus and • 
fonnulated with phospholipids. Such viral stmcuiral proteins are referred to herein as "particle- 
forming polypeptides". VLPs are not infectious because no viral genome is present, however, 
these non-rephcating, virus capsids mimic the structure of native virions. 

Due to their structure. VLPs can display a large number of antigenic sites on their surface 
(similar to a native vims). VLPs offer an advantage to live or attenuated vaccines in that they are 
much safer to both produce and administer, since they are not infectious. VLPs have been 
shown to induce both neutralizing antibodies as well as T-ceU responses and can be presented by 
both class I and n MHC pathways. 

Previous work creating VLPs from coronavims indicates that E and M proteins along may 
be sufficient for coronavirus VLP formation. See Fischer et aL, J. Virol. (1998) 72:7885-7894 
and Vennema er al. EMBO J. (1996) 15:2020-2028. 

Chimeric VLPs comprising particle-forming polypeptides or portions thereof ftom non- 
SARS Coronavimses are also included in the invention. Such particle-forming polypeptides may 
compnse a full length polypeptide from a non-SARS Coronavims. Alternatively, a particle- 
forming fragment may be used. 

In one embodiment, a fragment of a non-SARS particle-fonning polypeptide and a 
fragment of a SARS viral antigen are fused together. For instance, such chimeric polypeptides 
may comprise the the endodomain and transmembrane domain of a non-SARS particle-fonning 
polypeptide and the ectodomain of a SARS viral antigen. In one example, the VLPs of the 
invention comprise a chimeric spike protein comprising an endodomain and transmembrane 
domain of the spike protein of Mouse Hepatitis Vims (MHV) and the chimeric spike protein 
further comprises the ectodomain of the SARS spike protein. Such VLPs may further comprise 
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Coronavirus M and E proteins. Said M and E proteins may be selected from any coronavirus 
mcluding Mouse Hepatitis Virus (MHV) or SARS. Sample sequences of S, M and E proteins' of 
MHV are mcluded in the figures, supra. 

Chimeric spike proteins derived from the ectodomain of feline infectious peritonitis virus 
> (FIPV) spike protein fused to the endo and tiransmembrane domains of MHV spike protein have 
been previously disclosed. See WO 98/49195 and WO 02/092827. In these chimeric VLP 
stiructures, the capsid stinicture of the VLPs is formed by the M and E protein of MHV The 
chimeric spike protein provides for the surface exposure of the ectodomain of the FIPV spike 
protein. 

As used herein, the term "vims-like particle" or "VLP" refers to a non-replicating empty 
vmis shell. VUPs are generaUy composed of one or more viral proteins, such as, but not limited 
to those proteins referred to as capsid, coat, shell, surface and/or envelope proteins, or particle- 
fonrnng polypeptides derived from these proteins. VLPs can form spontaneously upon 
recombinant expressionof the protein in an approrpirateexpression system. Alternatively viral 
stiiictural proteins may be isolated from whole virus and formulated with phosphoUpids 
Methods for producing particular VLPs are known in the art and discussed more fully below 
The presence of VLPs in a composition can be detected using conventional techniques known in 
tiie art. such as by electron microscopy, x-ray crystallography, and the like. See, e.^. Baker 
al, Biophys, J. (1991) 60:1445-1456; Hagensee et al., J. Virol (1994) 68:4503-4505 For 
example, cryoelectron microscopy can be performed on vitrified aqueous samples of the VLP ■ 
preparation in question, and images recorded under appropriate exposure conditions. 

The phrase "particle-forming polypeptide" includes a full-length or near full-length viral 
protem, as well as a fi-agment thereof, or a viral protein with internal deletion, which has the 
ability to form VLPs under conditions that favor VLP fomiation. Accordingly, the polypeptide 
may comprise the full-length sequence, fragments, tmncated and partial sequences, as well as 
analogs and precm-sor forms of the reference molecule. The term therefore includes deletions 
additions and substitutions to the sequence, so long as the polypeptide retains tiie ability to form 
a VLP. Thus, the term includes natural variations of the specified polypeptide since variations in 
coat proteins often occur between viral isolates. The term also includes deletions, addition and 
substitutions that do not naturally occur in the reference protein, so long as the protein retains the 
ability to fonn a VLP. 

Inferred substitutions are those which are conservative in nature, i.e., those substitutions 
that take place within a family of annno acids that are related in their side chains. Specifically 
amino acids are generally divided into four famiUes: (1) acidic: aspartate and glutamate- (2) ' 
basic: lysine, arginine, and histidine; (3) non-polan alani™,. vaHne, leucine, isoleucine. i^Kne 
phenylalanine, methiomne, tryptophan; and (4) uncharged polar: glycine, asparagine. glutaminJ, 
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cystine, serine, theionine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes 
classified as aromatic amino acids. For example, it is reasonably predictable that an isolated 
replacement of leucine with isoleucine or valine, an asparate with a glutamate, a threonine with a 
serine, or a similar conservative replacement of an amino acid with a structurally related amino 
acid, will not have a major effect on the biological activity. Proteins having substantially the 
same amino acid sequence as the reference molecule, but possessing minor amino acid 
substitutions that do not substantially affect the immunogenicity of the protein, are therefore 
within the definition of the reference polypeptide. 

The VLPs of the invention can be formed from any viral protein, particle-forming 
polypeptide derived from the viral protein, or combination of viral proteins or fragments thereof, 
that have the capability of forming particles under appropriate conditions. The requirements for 
the particle-forming viral proteins are that if the particle is formed in the cytoplasm of the host 
cell, the protein must be sufficientiy stable in the host cell in which it is expressed such that 
formation of virus-like structures will result, arid that the polypeptide will automatically 
assemble into a virus-like structure in the cell of the recombinant expression system used. If the 
protein is secreted into culture media, conditions can be adjusted such that VLPs will form. 
Furthermore, the particle-forming protein should not be cytotoxic in the expression host and 
should not be able to replicate in the host in which the VLP will be used. 

Preferred particle-forming polypeptides include coronavirus M and E proteins, preferably 
SARS M and E proteins. 

Methods and suitable conditions for forming particles from a wide variety of viral proteins 
are known in the art. VLPs have been produced, for example from proteins derived from 
influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid proteins). Hepatitis 
E virus, measles virus, Sindbis virus. Rotavirus, Foot-and-Mouth Disease virus. Retrovirus, 
Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat proteins), 
GA-phage, fr-phage, AP205 phage, and Ty (such as retirotiransposon Ty protein pi). VLPs are 
discussed further in WO 03/024480, WO 03/024481, and Niikura et al.. Virology (2002) 
293:273-280; Lenz et a/., J. Immunology (2001) 5246-5355; Pinto, et al., J, Infectious Diseases 
(2003) 188:327-338; and Gerber et al. , /. Virology (2001) 75(10):4752-4760. 

As explained above, VLPs can spontaneously form when the particle-forming polypeptide 
of interest is recombinantly expressed in an appropriate host cell. Thus, the VLPs for use in the 
present invention may be prepared using recombinant techniques, well known in the art. In this 
regard, genes encoding the particle-forming polypeptide in question can be isolated from DNA 
libraries or directly from cells and tissues containing the same, using known techniques. The 
genes encoding the particle-forming polypeptides can also be produced synthetically, based on 
the known sequences. The nucleotide sequence can be designed with the appropriate codons for 
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the particular amino sequence desired. In general, one will select preferred codons for the 
intended host in which the sequence will be expressed (e.g. human codons for human DNA 
vaccines). The complete sequence is generally assembled from overlapping oligonucleotides 
prepared by standard methods and assembled into a complete coding sequence. See., e.g.. Edge, 
5 Nature (1981) 292:756; Nambair et at Science (1984) 223:1299; Jay et al, J. Biol. Ckem. (1984) 
259:6311. 

Once the coding sequences for the desired particle-forming polypeptides have been 
isolated or synthesized, they can be cloned into any suitable vector or replicon for expression. 
Numerous cloning vectors are known to those of skill in the art, and the selection of an 

10 appropriate cloning vector is a matter of choice. See, generally, Sambrook al. The vector is 
then used to transform an appropriate host cell. Suitable expression systems include, but are not 
limited to, bacterial, mammalian, bacuolvirus/insect, vaccinia, Semliki Forest virus (SFV), yeast, 
and Xenopus expression systems, well known in the art. 

A number of cell lines suitable for use as host cells for producing the VLPs of the 

15 invention are known in the art. Suitable mammalian cell lines include, but are not limited to, 
Chinese Hamster Ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) ceUs, monkey 
kidney cells (COS), human hepatocellular carcinoma cells (e.g. , Hep G2), Madin-Darby bovine 
kidney ("MDBK") ceUs, as well as others. Mammalian sources of ceUs include, but are not 
limited to, human or non-human primate (e.g., MRC-5 (ATCC CCI^171), WI-38 (ATCC CCL- 

20 75), HUH, human embryonic kidney cells (293 cells, typically transformed by sheared 

adenovirus type 5 DNA), VERO cells from monkey kidneys (including, for example COS7 
cells), horse, cow (e.g.. MDBK cells), sheep, dog (e.g., MDCK cells from dog kidneys, ATCC 
CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM ACC 2219 as described in WO 
97/37001), cat, and rodent (e.g., hamster ceUs such as BHBCZl-F, HKCC cells, or Chinese 

25 hamster ovary cells (CHO cells)), and may be obtained from a wide variety of developmental 
stages, including for example, adult, neonatal, fetal, and embryo. 

Bacterial hosts suitable for production of VLPs of the invention include E. coli. Bacillus 
subtilis, and Streptoccocus spp. Yeast hosts suitable for production of VLPs of the invention 
include Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansenula polymorplia, 

30 Kluyveromycesfragilis, Kluyveromyces lactis, Pichia guillerimondii, Pichia pastoris, 

Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells suitable for production of 
VLPs of the invention (i.e., via baculovirus expression vectors) include Aedes aegypti, 
Autographa califomica, Bombyx mori, Drosophila melanogaster, Spodpterafnigiperda, and 
Trichoplusia ni. 

J5 Viral vectors can be used for the production of particles in eukaryotic cells, such as those 

derived from the pox family of viruses, including vaccinia vims and avian poxvirus. Additional, 
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vaccinia based infection/transfection systems, such as those as described in Tomei et al, J: Virol 
(1993) 67:4017-4026 and Selby etal., J. Gen. Virol. (1993) 74:1103-1113, can also be used to 
generate the VLPs of the invention, Jn this system, cells are first transfected in vitro with a 
vaccinia virus recombinant that encodes the bacteriophage T7 RNA polymerase. This 
polymerase only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the DNA of interest, driven by a T7 promoter. The polymerase expressed in the 
cytoplasm from the vaccinia virus recombinant transcribes the transfected DNA into RNA which 
is then translated into protein by the host translation machinery. The method provides for high 
level, transient, cytoplasmic production of large quantities of RNA and its translation products. 

Depending on the expression system and host selected, the VLPs are produced by growing 
host cells transformed by an expression vector under conditions whereby the particle-forming 
polypeptide is expressed and VLPs can be formed. The selection of the appropriate growth 
conditions is within the skill of the art. If the VLPs are formed intracellularly, the cells are then 
disrupted, using chemical, physical or mechanical means, which lyse the cells yet keep the VLPs 
substantially intact. Such methods are known the those of skill in the art and are described in, 
e.g.. Protein Purification Applications: A Practical Approach, (E.L.V. Harris and S. Angal, 
Eds., 1990). 

The particles are then isolated using methods that preserve the integrity thereof, such as by . 
gradient centrifiigation, e.g., cesium chloride (CsCl) and sucrose gradients; and the like (see, e.g., 
Kimbauer et al, J. Virol. (1993) 67:6929-6936), ion exchange chromatography (including anion 
-exchange chromatography such as DMAE and TMAE), hydroxyapatitie chromatography (see 
WO 00/09671), hydrophobic interaction chromatography, gel filtration chromatography and 
other filtration methods such as nanometric filtration and ultrafiltration. Preferably at least one 
anion exchange step is performed during purification, and more preferably at least two anion 
exchange steps are used. 

VLP formulations of the invention may be further processed by methods known in the art 
to disassemble the VXJPs into smaller, protein containing moieties using a high concentration of 
reducing agent, followed by reassembly of the VLPs by either removal of the reducing agent or 
by addition of excess oxidant. The resulting reassembled VLPs may have improved 
homogeneity, stability and immunogenic properties. In addition, further therapeutic or 
prophylactic agents may be formulated into the VLPs upon reassembly. See McCarthy et al, J. 
Virology (1998) 72(1):32-41. See also WO 99/13056 and WO 01/42780. Reducing agents 
suitable for use in VLP disassembly include sulfhydryl reducing agents (such as glutathion, beta 
mercaptoethanol, dithiothreitol, dithioerythritol, cysteine, hydrogen sulfide and mixtures thereof) 
preferably contained in moderate to low ionic strength buffers. Sufficient exposure time of the 
VLPs to the reducing agent will be required to achieve a suitable amount of VLP disassembly. 
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Adjuvants may be added to the VLPs of the invention to enhance the immunogenicity of 
the SARS viral antigens. Antigens suitable for use with VLPs include those described, supra. 
For example, the VLPs of the invention may be adsorbed onto an aluminum adjuvant. 

The VLPs of the invention may formulated to enhance their stability. Additional 
components which may enhance the stability of a VLP formulation include salts, buffers, non- 
ionic surfactants and other stabilizers such as polymeric polyanion stabilizers. See WO ' 
00/45841. 

The ionic strength of a solution comprising VLP particles may be maintained by the 
presence of salts. Ahnost any salt which can contribute to the control of the ionic strength may 
be used. Preferred salts which can be used to adjust ionic strength include physiologically 
acceptable salts such as NaCl. KCl. Na2S04, (NH4hSO,, sodium phosphate and sodium citrate. 
Preferably, the salt component is present in concentrations of from about 0.10 M to 1 M. Very 
high concentrations are not preferred due to the practical limitations of parenteral injection of 
high salt concentrations. Instead, more moderate salt concentrations, such as more physiological 
concentrations of about 0.I5M to about 0.5M with 0.15M-0.32M NaCl are preferred. 

Buffers may also be used to enhance the stability of the VLP formulations of the invention.- 
Preferably, the buffer optimizes the VLP stabiHty while maintaining the pH range so that the 
vaccine formulation will not be irritating to the recipient. Buffers preferably maintain the pH of. 
the vaccine formulation within a range of p/H 5.5-7.0, more preferably 6.0-6.5. Buffers suitable 
for vaccine formulations are known in the art and include, for example, histidine and imidazole. 
Preferably, the concentration of the buffer will range from about 2mM to about 100 mM. more . 
preferably 5 mM to about 20 mM. Phosphate containing buffers are generally not preferred 
when the VLP is adsorbed or otherwise formulated with an aluminum compound. 

Non-ionic surfactants may be used to enchance the stabiUty of the VLP formulations of the : 
invention. Surfactants suitable for use in vaccine formulations are known in the art and include, 
for example, polyoxyethylene sorbital fatty acid esters (Polysorbates) such as Polysorbate 80 
ie.g., TWEEN 80), Polysorbate 20 (e.g.. TWEEN 20), polyoxyethylene alkyl ethers (e.g., Brij 
35, Brij 58), as well as others, including Triton X-100, Triton X-114. NP-40, Span 85 and the 
Pluronic series of non-ionic surfactants (e.g., Pluronic 121). The surfactant is preferably present 
in a concentration of from about 0.0005% to about 0.5% (wl/vol). 

Polymeric polyanion stabilizers may also be used to enchance the stability of the VLP 
formulations of the invention. Suitable polymeric polyanionic stabilizers for use in the invention 
comprise either a single long chain or multiple cross linked chains; either type possessing 
multiple negative charges along the chains when in solution. Examples of suitable polyanionic 
polymers include proteins, polyanions, peptides and polynucelic acids. Specific examples 
include carboxymethyl ceUulose, heparin, polyamino acids (such as poly(Glu). poly(Asp). and 
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Poly (Glu. Phe), oxidized glutathione, polynuceltodies, RNA, DNA and serum albumins. The 
concentration of the polmeric polyanion stabilizers is preferably jfrom about 0.01% to about 
0.5%, particularly about 0.05-0. 1% (by weight). 

G. Passive Immu nization via Antibodies to the SARS Antieetis of the Invention 
5 The invention includes antibodies specific to the S ARS antigens of the invention and 

methods of treatment or prevention of SARS virus related disease by administrating an effective 
amount of SARS antibodies to a mammalian subject Antibodies specific the SARS antigens can 
be produced by one skilled in the art. Preferably, the antibodies are specific to the spike (S) 
protein of the SARS virus. Potent neutralization of the SARS coronavirus using a human 
10 monoclonal anti-spike antibody has been reported (Sui et al. (2004) PNAS USA 101:2536-2541). 
A IgGl form of the monoclonal antibody showed a higher affinity (1.59 nM) than a scFv form 
(32.3 nM). 

The antibodies of the invention are specific and selective to SARS antigens. 
In one embodiment, the antibodies of the invention are generated by administering a SARS 
15 antigen to an animal. The method may also include isolating the antibodies from the animal. 

The antibodies of the invention may be polyclonal or monoclonal antibody preparations, 
monospecific antisera, human antibodies, or may be hybrid or chimeric antibodies, such as 
humanized antibodies, altered antibodies (Fab')2 ftagments, F(ab) firagments, Fv fi-agments, 
single-domain antibodies, dimeric or trimeric antibody fragments or constructs, minibodies, or 
iO functional fi-agments thereof which bind to the antigen in question. 

Antibodies are produced using techniques well known to those of skill in the art and 
disclosed in, for example, US Patent Nos. 4,011,308; 4,722,890; 4,016,043; 3,876,504; 
3,770,380; and 4,372,745. For example, polyclonal antibodies are generated by immunizing a 
suitable animal, such as a mouse, rat, rabbit, sheep, or goat, with an antigen of interest. In order 
15 to enhance inmiunogenicity, the antigen can be linked to a canier prior to inomunization. Such 
carriers are well known to those of ordinary skill in the art. Immunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund*s 
complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously 
or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections 
0 of the antigen in saline, preferably using Freund's incomplete adjuvant. Antibodies may also be 
generated by in vitro immunization, using methods known in the art. Polyclonal antiserum is 
then obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495-497, or a modification thereof. Typically, a mouse or rat is immunized as 
5 described above. Rabbits may also be used. However, rather than bleeding the animal to extract 
serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into 
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single cells. If desired, the spleen cells may be screened (after removal of non-specificaUy 
adherent cells) by applying a ceU suspension to a plate or well coated with the antigen. B-cells, 
expressing membrane-bound immunoglobulin specific for the antigen, will bind to the plate, and 
are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The resulting 
hybridomas are plated by limitmg dilution, and are assayed for the production of antibodies 
which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens). 
The selected monoclonal antibody-secreting hybridomas are then cultured either m vitro (e.g., in 
tissue culture botfles or hollow fiber reactors), or in vivo (e.g., as ascites in mice). 

Humanized and chimeric antibodies are also useful in the invention. Hybrid (chimeric) 
antibody molecules are generally discussed in Winter et al. (1991) Nature 349: 293-299 and US 
Patent No. 4,816,567. Humanized antibody molecules are generally discussed in Riechmann et 
al. (1988) Nature 332:323-327; Verhoeyan et al (1988) Science 239:1534-1536; and U;K. 
Patent Publication No. GB 2,276,169, pubUshed 21 September 1994). One approach to 
engineering a humanized antibody involves cloning recombinant DNA containing the promoter, 
leader, and variable-region sequences from a mouse antibody gene and the constant-region exons 
from a human antibody gene to create a mouse-human chimera, a humanized antibody. See 
generally, Kuby. "Immunology, 3"" Edition", W.H. Freeman and Company, New York (1998) at 
page 136. 

Antibody fragments which retain the abiUty to recognize a SARS antigen are also included . 
within the scope of the invention. A number of antibody fragments are known in the art which 
comprise antigen-binding sites capable of exhibiting immunological binding properties of an 
intact antibody molecule. For example, functional antibody fragments can be produced by 
cleaving a constant region, not responsible for antigen binding, from the antibody molecule, 
using e.g., pepsin, to produce F(ab')2 fragments. These fragments will contain two antigen 
binding sites, but lack a portion of the constant region from each of the heavy chains. Similarly, 
if desired. Fab fragments, comprising a single antigen binding site, can be produced, e.g., by 
digestion of polyclonal ormonclonal antibodies with papain. Functional fragments, includnig 
only the variable regions of the heavy and light chains, can also be produced, using standard 
techniques such as recombinant production or preferential proteolytic cleavage of 
immunoglobulin molecules. These fragments are known as Fy. See, e.g. , Inbar et al (1972) 
Proc. Nat. Acad. Sci USA 69:2659-2662; Hochman et al (1976) Biochem 15:2706-2710; and 
Ehrlich etal (1980) Sfoc^iem 19:4091-4096, 

A single-chain Fv ("sFv" or scFv") polypeptide is a covalentiy linked Vh-Vl heterodimer 
which is expressed from a gene fusion including Vn-and Vl- encoding genes linked by a peptide- 

-157- 



wo 2004/092360 PCT/US2004/011710 
encoding Unker. Huston et al. (1988) Proc. Nat. Acad. Set USA 85:5879-5883. A number of 
methods have been described to discern and develop chemical stracutres (Hnkers) for converting 
the naturally aggregated, but chemicaUy separated, light and heavy polypeptide chains from an 
antibody V region into an sFv molecule which will fold into a three dimensional structure 
substantially similar to the structure of an antigen-binding site. See, e.g., US Patent Nos. 
5,091,513; 5,132,405; and 4,946,778. The sFv molecules may be produced using methods 
described in theart. See, e.g., Huston etal. (1988) Pr<7c. Nat. Acad. Sci t/SA 85:5879-5338; US 
Patent Nos. 5,091.513; 5.132,405 and 4,946,778. Design criteria include determining the ' 
appropriate length to span the distance between the C-terminus of one chain and the N-terminus 
of the other, wherein the linker is generally formed from small hydrophilic amino acid residues 
that do not coil or form secondary structures. Such methods have been described in the art. See. 
e.g., US Patent Nos. 5,091,513; 5,132,405 and 4,946,778. Suitable linkers generally comprise ' 
polypeptide chains of altemating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubility. Anti-spike scFv antibodies have been 
reported (Sui aZ. (2004) />ArA5 i75A 101:2536-2541). 

"Mini-antibodies" or "minibodies" will also find use with the present invention. 
Minibodies are sFv polypeptide chains which include oligomerization domains at their C- 
termini, separated from the sFv by a hinge region. Pack et al, (1992) Biochem 31:1579-1584. 
The oligomerization domain comprises self-associating a-helices, e.g., leucine zippers, that can 
be further stabihzed by additional disulfide bonds. The oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facilitate in vivo 
folding of the polypeptide into a functional binding protein. Generally, minibodies are produced^- 
using recombinant methods weU known in the art. See, e.g.. Pack et al, (1992) Biochem 
31 : 1579-1584; Cumber et al. (1992) J. Immunology 149B : 120-126. 

Non-conventional means can also be used to generate and identify the antibodies of the 
invention. For example, a phage display Ubrary can be screened for antibodies which bind to the 
SARS antigens of the invention. See generaUy, Siegel, "Recombinant Monoclonal Antibody 
Technology", Transjus. Clin. Biol. (2002) 9(1): 15-22; Sidhu. "Phage Display in Phannaceutical 
Biotechnology", Curr. Opin. Biotechnol (2000) 11(6):610-616; Sharon, et al., "Recombinant 
Polyclonal Antibody Ubraries". Comb, aietn. High Throughput Screen (2000) 3(3): 185-196; 
and Schmitz et al., "Phage Display: A Molecular Tool for the Generation of Antibodies- 
Review", Placenta, (2000) 21 Suppl A: S 106- 12. 

The antibodies of the invention may also be generated by administering the polynucleotide 
sequence encoding for the SARS antigen into an animal. The SARS antigen is then expressed in . 
vivo, and antibodies specific to the SARS antigen are generated in vivo. Methods for 
polynucleotide delivery of the SARS antigens of the invention are discussed in section 4 below. 
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The antibodies of the invention are preferably specific to the SARS virus. 

a Combinatio„^ of one or more ofar^ nfih. „h^ e ar,nrn„rh,. l„ 

The compositions of the invention further comprise combinations of one or mote of the 

compositions discussed above. For instance, the invention comprisesacomposition comprising 

an attenuated SARS virus and a subunit SARS viral antigen. 

/. Qombmatim,.. of SAKS anti^e„, mul „tlu..r Ke^,.^,„ „ Vir., A„t,..„. 

-meinventionfurtherrelates to vaccine fonnulations comprising one ormore SARS viri,s 
an^g^s and one or more other respiratory virus antigens. Additional respiratory virus antigens 
suttable for use m the mvention include antigens from influen^a virus, human tWnovirus (HRV) 
paratufluenza virus (PIV). respiratory syncytia, virus (RSV). adenovims. metapneumovirus and 
^tnov^s. The additionalrespiratory Virus antigen could also be ftoma^^ 
the SARS coronavrus, such as the NL63 human cc^virus (van der Hoek e, al. OXmyNamre 
10:368-373). Rreferably.theadditionalrespiratory virus antigen is an i^^^ 



wtfl, the SARS vu:al antigen. Antigens may be used alone or in any combination (See . g 
WO 02A)0249 describing the use of combinations of bacterial antigens). The combinations may 
mc ude multiple antigens from ti,e same pathogen, multiple antigens from different pafltogens o^ 
multiple antigens from tite same and from different pafliogens, TTus. bacterial, viral, and/or 
other antigens may be included in tite same composition or may be administered to fl,e same 
subject separately. It is generally preferred that combinations of antigens be used to raise an 
immune response be used in combinations. 

^.^r'^IT^ """"" °' ^"^^^ »ay be used in fl,e invention include 

d^phthe„a (&e. e.,.. S ot Vaccines, 199Z.^.^i^ii,^,,^^^^^_^^^ 

357.1225-1240). cholen. nierculosis. C. ,eu^. also known as tetanus iSee. e.g.. Chapter 4 of 

1998. eds.Plofldn&Mortimer (ISBN 0-7216-1946-0). GroupAandCioupB 
streptococcus (including S^^.<,c.cc«.„„^^,5.„p,.c<,cc^ 

^ogenes as described, for example, in Watson e, al. (2000) Pediatr. Infect. Dis. J 19-331-332- 
Rub.n«<rf. (2000)P.^,a..a.,.;v.«*^. 47:269-284; Jedrzejas «Z. (2001) Microbiol Mc' 
BK./Rev65:187-207;Schuchat(1999)La«ce,353.'ii «.f^R ,. ">"'"""<"Mol 

Tse 'r Z'^rt " "«'^CH9:23^238). meningitis, Mora^^lU, cau^Ualis 
(See. e.g.. McMtchael (2000) Vaccine 19 Suppl. 1:8101-107) and other pathogenic states. 
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including, without limitation. Neisseria meningitides (A, B, C, Y), Neisseria gonorrhoeae {See, 
e.g., WO 99/24578; WO 99/36544; and WO 99/57280), Helicobacter pylori (e.g., CagA, VacA, 
NAP, HopX, HopY and/or urease as described, for example, WO 93/18150; WO 99/53310; WO 
98/04702) and Haemophilus influenza. Hemophilus influenza type B (HEB) (See, e.g., 

5 Costantino et al. (1999) Vaccine 17: 125 1-1263), Porpkyromonas gingivalis (Ross et al (2001) 
Vaccine 19:4135-4132) and combinations thereof. 

Non-limiting examples of viral pathogens which may be used in the invention include 
meningitis, rhinovirus, influenza (Kawaoka et al.. Virology (1990) 179:759-767; Webster et al., 
"Antigenic variation among type A influenza viruses," p. 127-168. In: P. Palese and D.W. 

10 Kingsbury (ed.). Genetics of influenza viruses. Springer-Verlag, New York), respiratory 

syncytial virus (RSV), parainfluenza virus (PIV), rotavirus {e.g., VPl, VP2, VP3, VP4, "VP6, 
VP7, NSPl, NSP2, NSP3, NSP4 or NSP5 and other rotavirus antigens, for example as described 
in WO 00/26380) and the like. Antigens derived from other viruses will also find use in the 
present invention, such as without limitation, proteins from members of the families 

15 Picomaviridae {e.g., polioviruses, etc. as described, for example, in Sutter et al. (2000) Pediatr 
Clin North Am 47:287-308; Zimmerman & Spann (1999) Am Fani Physician 59:1 13-1 18; 125- 
126); Caliciviridae; Togaviridae {e.g., rubella virus, etc.); Haviviridae, including the genera 
flavivirus {e.g., yellow fever virus, Japanese encephalitis virus, serotypes of Dengue virus, tick 
borne encephalitis virus. West Nile virus, St. Louis encephalitis virus); pestivirus {e.g., classical 

10 porcine fever virus, bovine viral diarrhea virus, border disease virus); and hepacivirus {e.g.. 
hepatitis A, B and C as described, for example, in US Patent Nos. 4,702,909; 5,011,915; 
5,698,390; 6,027,729; and 6,297,048); Parvovirus {e.g., parvovirus B19); Coronaviridae; 
Reoviridae; Bimaviridae; Rhabodoviridae {e.g., rabies virus, etc. as described for example in 
Dressen et al. (1997) Vaccine 15 Suppl:s2-6; MMWR Morb Mortal Wkly Rep. 1998 Jan 

15 16:47(1): 12, 19); Filoviridae; Paramyxoviridae {e.g., mumps virus, measles virus, respiratory 
syncytial virus, etc. as described in Chapters 9 to 11 of Vaccines, 1998, eds. Plotkin & Mortimer 
(ISBN 0-7216-1946-0); Orthomyxoviridae {e.g., influenza virus types A, B and C, etc. sis 
described in Chapter 19 of Vaccines, 1998, eds. Plotkin & Mortimer aSBN 0-7216-1946-0)..); 
Bunyaviridae; Arenaviridae; Retroviradae {e.g., HTLV-1; HTLV-11; HIV-l (also known as 

50 HTLV-m, LAV, ARV, HTI,R, etc.)), including but not limited to antigens from the isolates 

HlVnib, HIVSF2, fflVLAV, HIVI-AL, I-IIVMN, SF162); HIV- I CM235, HFV- 1 US4; HIV-2; 
simian immunodeficiency virus (SIV) among others. Additionally, antigens may also be derived 
from human papilloma virus (HPV) and the tick-bome encephalitis viruses. See, e.g. Virology, 
3"* Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (B.N. Fields and D.M. 

\5 Knipe, eds, 1991), for a description of these and other viruses. 
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Proteins may also be derived from the herpesvirus family, including proteins derived from 
herpes simplex virus (HSV) types 1 and 2, such as HSV-1 and HSV-2 glycoproteins gB, gD and 
gH; antigens derived from varicella zoster virus (VZV), Epstein-Barr vims (EBV) and 
cytomegalovirus (CMV) including CMV gB and gH (See, US Patent No. 4,689,225 and PCX 
Publication WO 89/07143); and antigens derived from other human herpesviruses such as HHV6 
and HHV7. (See, e.g. Chee et al. Cytomegaloviruses (J.K. McDougaU, ed., Springer-Verlag 
1990) pp. 125-169, for a review of the protein coding content of cytomegaloviras; McGeoch.ef 
al, J. Gen. Virol. (1988) 69:1531-1574, for a discussion of the various HSV-1 encoded proteins; 
US Patent No. 5,171,568 for a discussion of HSV-1 and HSV-2 gB and gD proteins and the 
genes encoding therefor; Baer et al. Nature (1984) 310:207-211, for the identification of protein 
coding sequences in an EBV genome; and Davison and Scott, J. Gen. Virol (1986) 67:1759- 
1816, for a review of VZV). Herpes simplex virus (HSV) rgD2 is a recombinant protein 
produced in genetically engineered Chinese hamster ovary cells. This protein has the normal 
anchor region truncated, resulting in a glycosylated protein secreted into tissue culture medium. 
15 The gD2 can be purified in the CHO medium to greater than 90% purity. Human 

immunodeficiency vims (HIV) env-2-3 is a recombinant form of the HIV enveloped protein 
• produced in geneticaUy engineered Saccharomyces cerevisae. This protein represents the entire 
protein region of HIV gpl20 but is non-glycosylated and denatured as purified from the yeast. 
mV gpl20 is a frilly glycosylated, sebreted form of gpl20 produced in CHO cells in a fashion 
iO similar to the gD2 above. Additional HSV antigens suitable for use in immunogenic 

compositions are described in PCT Publications WO 85/04587 and WO 88/02634, the disclosures '■■ 
of which are incorporated herein by reference in their entirety. Mixtures of gB and gD antigens, 
which are truncated surface antigens lacking the anchor regions, are particularly preferred. 

Antigens from the hepatitis family of viruses, including hepatitis A virus (HAV) {See. e.g., 
\5 Bell et al (2000) Pediatr Infect Dis. J. 19:1187-1 188; Iwarson (1995) APMIS 103:321-326), 
hepatitis B vurus (HBV) {See, e.g., Gerlich et ai. (1990) Vaccine 8 Suppl:S63-68 & 79-80), 
hepatitis C virus (HCV) {See, e.g., PCT/US88/04125, published European appUcation number 
318216), the delta hepatitis virus (HDV). hepatitis E virus (HEV) and hepatitis G virus (HGV), 
can also be conveniently used in the techniques described herein. By way of example, the viral 
0 genomic sequence of HCV is known, as are methods for obtaining the sequence. See. e.g. , 

International Publication Nos. WO 89/04669; WO 90/11089; and WO 90/14436. Also included 
in the invention are molecular variants of such polypeptides, for example as described in 
PCT/US99/31245; PCT/US99/31273 and PCT/US99/31272. The HCV genome encodes several 
viral proteins, including El (also known as E) and E2 (also known as E2msi) and an N-terminal 
5 nucleocapsid protein (termed "core") (see, Houghton et al, Hepatology (1991) 14:381-388, for a 
discussion of HCV proteins, including El and E2). Similarly, the sequence for the 5-antigen 
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from HDV is known (see. e.g., US Patent No. 5,378,814) and this antigen can also be 
conveniently used in the present composition and methods. Additionally, antigens derived from 
HB V. such as the core antigen, the surface antigen, SAg, as well as the presurface sequences, 
pre-Sl and pre-S2 (formerly called pre-S), as well as combinations of the above, such as 
; SAg/pre-Sl. SAs/pre-S2, SAg/pre-Sl/pre-S2, and pre-Sl/pre.S2. will find use herein. See, e.g., 
"HBV Vaccines - from the laboratory to Hcense: a case study" in Mackett. M. and Williamson, ' 
J.D., Human Vaccines and Vaccutation, pp. 159-176, for a discussion of HBV structure- and US 

PatentNos. 4,722,840. 5,098.704,5,324,513,incorporated herein by reference in their eLtk^^^ 
Beames et al, J. Virol (1995) 69:6833-6838, Bimbaum et al, J. Virol. (1990) 64-3319-3330- 
and Zhou et al, J. Virol (1991) 65:5457-5464. Each of these proteins, as well as antigenic ' 
fragments thereof, will find use in the present composition and methods. 

Influenza virus is another example of a virus for which the present invention will be 
particularly useful. Specifically, the envelope glycoproteins HA and NA of influenza A are of 
particular interest for generating an immune response. Numerous HA subtypes of influenza A 
have been identified (Kawaoka et al. Virology (1990) 179:759-767; Webster et al, "Antigenic 
variation amongtype A influenza viruses." p. 127-168. In: P. Palese and D.W. Kingsbury (ed ) 
Genetics of influenzaviruses. Springer-Verlag, New York). Thus, proteins derived from any of 
these isolates can also be used in the compositions and methods described herein. 

Non-limiting examples of parasitic antigens include those derived from organisms causing . 

malaria and Lymie disease. 

The methods of the invention comprise administering an hnmunogenic composition 

comprising a SARS viral antigen (including one or more of an inactivated S ARS virus, an . 

attenuated SARS virus, a split SARS virus preparation or a recombinant or purified subunit 

formulation of one or more SARS viral antigens) to an animal. The immunogenic compositions 

used m the invention can comprise an immunologically effective amount of the SARS viral 

antigen. An "immunologically effective amount" is an amount sufficient to allow tiie mammal • 

to raise an immune response to the SARS antigen. 

The immune response preferably involves the production of antibodies specific to the 

SARS antigen. The amount of antibodies produced wiU vary depending on several factors 

including tiie animal used, the presence of an adjuvant, etc. 

The hnmunogenic compositions of the invention may fiirther comprise one or more 

adjuvants. 

The immunogenic compositions of the invention may be administered mucosally. Suitable 
routes of mucosal administration include oral, inti-anasal, intiragastric, pulmonary, intestinal, 
rectal, ocular and vaginal routes. The immunogenic composition may be adapted for mucosal 
administration. For instance, where the composition is for oral administration, it may be in the 
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foim of tablets or capsules. optionaUy enteric-coated, liquid, transgenic plants, etc. Where the 
composition is for intranasal administration, it may be in the form of a nasal spray, nasal drops, 
gel or powder. 

The immunogenic compositions of the invention may be administered parenteraUy. 
Suitable routes of parenteral administration include intramuscular (IM), subcutaneous, 
intravenous, intraperitoneal, intrademial. transcutaneous, and transdennal {see e.g., fotemational 
patent application WO 98/20734) routes, as weU as delivery to the interstitial space of a tissue. 
The immunogenic composition may be adapted for parenteral administration, for instance in the 
form of an injectable that may be sterile and pyrogen fiee. 

Vaccines of the invention may be administered in conjunction with other 
inmiunoregulatory agents. In particular, compositions will usually include an adjuvant. Prefeired 
further adjuvants include, but are not limited to. one or more of the following set forth below: 

A. Mineral Containing Compositions 

Mineral containing compositions suitable for use as adjuvants in the invention include 
mineral salts, such as aluminium salts, and calcium salts. The invention includes mineral salts 
such as hydroxides {e.g. oxyhydroxides), phosphates {e.g. hydroxyphoshpates. orthophosphates), 
sulphates, etc. \e.g. see chapters 8 & 9 of Vaccine design:the subunU and adjuvant approach 
(1995) PoweU & Newman. ISBN 0-306-44867-X.), or mixtures of different mineral compounds, 
with the compounds taking any suitable form {e.g. gel, crystaUine, amorphous, etc.\ and with' 
adsorption being preferred. The mineral containing compositions may also be formulated as a 
particle of metal salt. See WOOO/23105. 

B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene- 
water emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated 
into submicron particles using a microfluidizer). See WO90/14837. See also, Frey et al. , 
"Comparison of the safety, tolerability, and immunogenicity of a MF59-adjuvanted influenza 
vaccine and a non-adjuvanted influenza vaccine in non-elderiy adults". Vaccine (2003) 21-4234- 
4237. ~ 

Particularly preferred adjuvants for use in the compositions are submicron oil-inwater 
emulsions. Preferred submicron oil-in-water emulsions for use herein are squalene/water 
emulsions optionally containing varying amounts of MTP-PE, such as a submicron oil-in-water 
emulsion containing 4-5% w/v squalene, 0.25-1.0% w/v Tween 80 ™ (polyoxyelthylenesorbitan 
monooleate), and/or 0.25-1.0% Span 85™ (sorbitan trioleate), and, optionally. N-acetylmuramyl- 
L-alanyl-D-isogluatminyl-L-alanine-2-(r-2'-dipalmitoyl-^n-gIycero-3- 
huydroxyphosphophoryloxy)-ethylamine (MTP-PE), for example, the submicron oil-in-water 
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emulsion known as "MFSQ" (Intemational Publication No. WO 90/14837; US Patent Nos. 
6,299,884 and 6,451,325, incorporated herein by reference in their entireties; and Ott et al., 
"MF59 - Design and Evaluation of a Safe and Potent Adjuvant for Human Vaccines" in Vaccine 
Design: Tlie Subunit and Adjuvant Approach (PoweU. M.R and Newman, M.J. eds.) Plenum 
Press, New York. 1995, pp. 277-296). MF59 contains 4-5% w/v Squalene {e.g., 4.3%), 0.25- 
0.5% w/v Tween 80™, and 0.5% w/v Span 85™ and optionaUy contains Various amounts of 
MTP-PE, formulated into submicron particles using a microfluidizer such as Model HOY 
miciofluidizer (Microfluidics, Newton, MA). For example, MTP-PE may be present in an 
amount of about 0-500 /xg/dose, more preferably 0-250 jig/dose and most preferably, 0-100 
M-g/dose. As used herein, the term "MF59-0" refers to the above submicron oil-in-water 
emulsion lacking MTP-PE, while the term MF59-MTP denotes a formulation that contains MTP- 
PE. For instance. "MF59-100" contains 100 Hg MTP-PE per dose, and so on. MF69, another 
submicron oil-in-water emulsion for use herein, contains 4.3% w/v squalene, 0.25% w/v Tween 
so™, and 0.75% w/v Span 85™ and optionally MTP-PE. Yet another submicron oil-in-water 
emulsion is MF75, also known as SAF, containing 10% squalene, 0.4% Tween 80™, 5% 
pluronic-blocked polymer L121, and thr-MDP, also microfluidized into a submicron emulsion. 
MF75-MTP denotes an MF75 formulation that includes MTP, such as from 100-400 ng MTP-PE 
per dose. 

Submicron oil-in-water emulsions, methods of making the same and immunostimulating 
agents, such as liiuramyl peptides, for use in the compositions, are described in detail in 
Intemational Publication No. WO 90114837 and US Patent Nos. 6.299,884 and 6,45 1,325, 
incorporated herein by reference in their entireties. 

Complete Freund's adjuvant (CFA) and incomplete Freund's adjuvant (IFA) may also be 
used as adjuvants in the invention. 
C. Saponin Formulations 

Saponin formulations, may also be used as adjuvants in the invention. Saponins are a 
heterologous group of sterol glycosides and triterpenoid glycosides that are found in the bark, 
leaves, stems, roots and even flowers of a wide range of plant species. Saponin from the bark of 
the Quillaia saponaria MoUna tree have been widely studied as adjuvants. Saponin can also be 
commerciaUy obtained from SmUax omata (sarsapriUa), Gypsophilla paniculata (brides veil), 
and Saponaria officianalis (soap root). Saponin adjuvant formulations include purified 
formulations, such as QS21, as well as lipid formulations, such as ISCOMs. 

Saponin compositions have been purified using High Performance Thin Layer 
Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP- 
HPLC). Specific purified fractions using these techniques have been identified, including QS7, 
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QS17.QS18,QS21.QH-A.QH-BandQH-C. Preferably, the saponin is QS21. A method of 
production of QS21 is disclosed in US Patent No. 5,057.540. Saponin formulations may also 
comprise a sterol, such as cholesterol (see WO 96/33739). 

Combinations of saponins and cholesterols can be used to fonn unique particles called 
Immunostimulating Complexs (ISCOMs). ISCOMs typically also include a phospholipid such 
as phosphatidylethanolamine or phosphatidylcholine. Any known saponin can be used in 
ISCOMs. Preferably, the ISCOM includes one or more ofQuil A. QHA and QHC. ISCOMs are 
further described in EP 0 109 942, WO 96/11711 and WO 96/33739. Optionally, the ISCOMS 
may be devoid of additional detergent. See WOOO/07621. 

A review of the development of saponin based adjuvants can be found at Baix, et cd 
'ISCOMs and other saponin based adjuvants". Advanced Drug Delivery Reviews (1998) ' 
32:247-271. See also Sjolander. et al, "Uptake and adjuvant activity of orally delivered saponin 
and ISCOM vaccines". Advanced Drug Delivery Reviews (1998) 32:321-338. 
D. Bacteri al or Microbial Derivfltivpg 

Adjuvants suitable for use in the invention include bacterial or microbial derivatives such 

as: 

(1) Non-toxic derivatives of enterobacterial lipopolysaccliaride (LPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-O-deacylated MPL 
(3dMPL). 3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4 5 or 6 
acylated chains. A preferred "small particle" form of 3 De-O-acylated monophosphoryl Hpid A 
is disclosed in EP 0 689 454. Such "small particles" of 3dMPL are small enough to be sterile 
filtered through a 0.22 micron membrane (see EP 0 689 454). Other non-toxic LPS derivatives 
include monophosphoryl lipid A mimics, such as aminoalkyl glucosaminide phosphate ■ 
derivatives e.g. RC-529. See Johnson et al. (1999) Bioorg Med Chem Lett 9:2273-2278. 

(2) Lipid A Derivatives 

lipid A derivatives include derivatives of Upid A from Escherichia coli such as OM-174 
OM-174 is described for example in Meraldi et al, "OM-174. a New Adjuvant with a Potential 
for Human Use. Induces a Protective Response with Administered with the Synthetic C- 
Teiminal Fragment 242-310 from the circumsporozoite protein of Plasmodium berghei" 
Vaccine (2003) 21:2485-2491; and Pajak. et al, "The Adjuvant OM-174 induces both the 
migration and maturation of murine dendritic cells in vivo". Vaccine (2003) 21:836-842. 

(3) Immunostimulatory oligonucleotides 

Immunosamulatory oligonudeoUdes suitable for use as adjuvants m the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an umnethylated cytosine 
followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
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oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
immunostimulatory. 

The CpG's can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine may be 
replaced with an analog such as 2'-deoxy-7-deazaguanosine. See Kandimalla, et al, "Divergent 
synthetic nucleotide motif recognition pattern: design and development of potent 
immunomodulatory oligodeoxyribonucleotide agents with distinct cytokine induction profiles". 
Nucleic Acids Research (2003) 31(9): 2393-2400; WO 02/26757 and WO 99/62923 for 
examples of possible analog substitutions. The adjuvant effect of CpG oligonucleotides is fiirther 
discussed in Kneg, "CpG motifs: the active ingredient in bacterial extracts?". Nature Medicine 
(2003) 9(7): 831-835; McCluskie, etal, "Parenteral and mucosal prime-boost immunization 
strategies in mice with hepatitis B surface antigen and CpG DNA", FEMS Immunology and 
Medical Microbiology (2002) 32:179-185; WO 98/40100; US Patent No. 6,207,646; US Patent 
No. 6,239,116 and US Patent No. 6,429,199. 

The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 
Kandimalla, et al, "Toll-like receptor 9: modulation of recognition and cytokine induction by 
novel synthetic CpG DNAs", Biochemical Society Transactions (2003) 31 (part 3): 654-^658. The 
CpG sequence may be specific for inducing a Thl immune response, such as a CpG-A ODl^, or 
it may be more specific for inducing a B cell response, such a CpG-B ODN. CpG-A and CpG-B 
ODNs are discussed in Blackwell, et al, "CpG-A-Induced Monocyte IFN-gamma-Inducible 
Protein-10 Production is Regulated by Plasmacytoid Dendritic CeU Derived IFN-alpha", J. 
Immunol. (2003) 170(8):4061-4068; Krieg, "From A to Z on CpG", TRENDS in Immunology 
(2002) 23(2): 64-65 and WO 01/95935. Preferably, tiie CpG is a CpG-A ODN. 

Preferably, the CpG oligonucleotide is constructed so that the 5' end is accessible for 
receptor recognition. Optionally, two CpG oligonucleotide sequences may be attached at their 3' 
ends to form "immunomers". See, for example, Kandimalla, et al, "Secondary structures in CpG 
oligonucleotides affect immunostimulatory activity", BBRC (2003) 306 :948-953: Kandimalla, et 
al., "Toll-like receptor 9: modulation of recognition and cytokine induction by novel synthetic 
GpG DNAs", Biochemical Society Transactions (2003) 31(part 3):664-658; Bhagat et al, "CpG 
penta- and hexadeoxyribonucleotides as potent inununomodulatory agents" BBRC (2003) 
300:853-861 and WO 03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

Bacterial ADP-ribosylating toxins and detoxified derivatives thereof, may be used as 
adjuvants in the invention. Preferably, the protein is derived from E. coli (le., E. coli heat labile 
enterotoxin "LT), cholera ("CT"), or pertussis ("PT"). The use of detoxified ADP-ribosylating 
toxins as mucosal adjuvants is described in WO 95/1721 1 and as parenteral adjuvants in WO 
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98/42375. Preferably, theadjnvantisadMoxifi«lLTmutantsuchasLT-K63 LT-R72 and 
L^jr ADP-ribosyla«„g toxins and deto«fied deriva«es hereof, partijularly 

LT-K63 andLT-R72. as adjuvants can befoundin fte foUowingrefe.«„oes. each of which is 
speoficallyincon^^tedbyreferencehereininfteirentoty: Bdgnon. "Trhe LTO72 
5 Muten, Of Haat-Labile Enterotoxin of Escherichia coli Enahnces fte AbiHty of Peptide Antigens 

toEhc.tCD4.TCeUs and Secrete Gan™aL,terferonafterCoappBca.iono„.oBls,o^^^^ 
L>fect,on and Immunity (2002) 20(6):3012-3019; Hzza. 'Mucosal vaccines: non-toxic 
denvatives of LT and CT as mucosal adjuvants". Vaccine (2001) 19:2534-2541 • Pizza. 'e,al. ' 
•LTK63 and LTR72. two mucosal adjuvants ready for cUnical trials" Iht J. Med Mic.„biol" 
. «(.5):45^,Scharton-K^.en.....r.3cutaneous^^^^^ 

™' ""^'"'^ Adjuvants". Infecdon and Immunity 

a0O0m^):53O6-5313:Rya„«^...M„.^,„,H,,,,,,^,^„„^^^^^^^^^^ 

f^=7™**--'-A<'j-antsfarNasalDeUveryofanAcellularPertussis Vaccine: Differential 

Effects of the Nontoxic AB qomplex and Enzyme Activity on Thl and 71,2 Cells" Infection and 

—ty(1999)^(12):6270-«280;Partid„s«... labile ente^^^^^^ 

and .ts s. e-dnccted mutant LTK63 enhance the proliferative and cytotoxic T-cell responses to 

mtranasa^lyco-immunizedsyn.heticpeptides".ImmnnoLUt..(1999)ffl(3):2M^^ 

Mutantsofd,eEschericMacoUheat.labiIeen,erotoxir.assafeandsttongadjuv.^^^^^ .. 
mtranasal dehvery of vaccines", Vaccines (2003) 2(2):285-293: and Pine e. al (2002) 
••Inti:anasal immunization wifl, influenza vaccine and a detoxified mutant of belt labile . 
enterotoxin from Escherichia coli (LTK63r J. Control Release (2002) 85(l-3)-263-270 
Nl^calreferen^for amino acid substitutions is preferably based onta 
(m5)Trn«,1?!f '''■'^''"''"""""^ 

(1995)15(6):1165-1167,specificaUyincoq»n.tedhereinbyreferenceinitsentirety. 

E. Human Immunnmodulators 

Human immunomodulators suitable for use as adjuvants in a,e invention include cytokines 
suchasmter>euk,ns(.,.II.l.iL.2.n^.IL.5.n^,j^, jj^j, ^ 
mterferon-T). macrophage colony stimulating factor, and tamor necmsis factor. 

F. Bioadhesives and Mucoadhpsivpg 

Bioadhesives and mucoadhesives may also be used as adjuvants in U» invention. Suitable 
b,odhes.vesmc,udeesteri^^ 

70.267-276) or mucoadhesives such as cross-Unked derivatives of poIy(ac,yHc acid), polyvinyl 
alcohol, polyvmyl pyrolUdone. polysaccharides and carboxymethylcellulose. Chi.os» 
denvatives thereof may also be used as adjuvants in the invention. E.g., WO99/27960 
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G. MicroDarticlRs 

Microparticles may also be used as adjuvants in the invention. Microparticles (i.e. a 
particle of -lOOnm to ~150Atm in diameter, more preferably ~200nm to -dOfim in diameter and 
most preferably ~500mn to ~10/im in diameter) formed from materials that are biodegradable 
; and non-toxic (e.g, a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a 
polyanhydride, a polycaprolactone, etc.), with polydactide-co-glycolide) are preferred. ' 
optionally treated to have a negatively-charged surface (e.g. with SDS) or a positively^harged 
surface (e.^. with a cationic detergent, such as CTAB). 

H. Liposomes 

Examples of liposome formulations suitable for use as adjuvants are described in US 
Patent No. 6,090,406, US Patent No. 5,916,588, and EP 0 626 169. 

I. PolyoxYethyl ene eth er a nd PolvoxvethvlPne Ester Fnrmnlati^^c 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
polyoxyethylene esters. W099/52549. Such formulations further include polyoxyethylene 
sorbitan ester surfactants in combmation with an octoxynol (WOOl/21207) as weU as 
polyoxyethylene alkyl ethers or ester surfactants in combination with at least one additional non- • 
ionic surfactant such as an octoxynol (WOOl/21 152). 

Preferred polyoxyethylene ethers are selected from the following group: polyoxyethylene- 
9-lauryl ether Oaureth 9), polyoxyethylene-9-steoryl ether, polyoxytheylene-8-steoryl ether, 
polyoxyethylene-4-lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-Iauryl 
ether. 

J. PolvDhos nhazene (PCPP) 

PCPP formulations are described, for example, in Andrianov et al, "Preparation of 
hydrogel microspheres by coacervation of aqueous polyphophazene solutions", Biomaterials 
(1998) 19(1-3):109-1 15 and Payne et al., "Protein Release from Polyphosphazene Matrices", 
Adv. Drug. Delivery Review (1998) 31(3): 185-196. 
K. Muram vl pep tides 

Examples of muramyl peptides suitable for use as adjuvants in the invention include N- 
acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP). N-acetyl-noniiuramyl-L-alanyl-D- 
isoglutamine (nor-MDP). and N-acetylmuramyl-L-alanyI-D.isoglutaminyl-Lialanine-2-(r-2'- 
dipalmitoyl-j/i-glycero-3-hydroxyphosphoryloxy)-ethylamineMTP-PE). 
L. Imida zoQuinolone Comp oiinds 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention 
include Imiquamod and its homologues, described further in Stanley. "Lniquimod and the 
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imidazoquinolones: mechanism of action and therapeutic potential" Clin Exp Dermatol (2002) 
ZLOy-^H-Sll and Jones, "Resiquimod 3M", Curr Opin Investig Drugs (2003) 4(2):214-218. 
M. Viro somes and Virus Like Particles rVT^Ps't 

Virosomes and Virus Like Particles (VLPs) can also be used as adjuvants in the invention. 
These shiicfajres generally contain one or more proteins j6om a virus optionally combined or 
formulated with a phosphoUpid. They are generally non-pathogenic, non-replicating and 
generally do not contain any of the native viral genome. The vkal proteins may be recombinantiy 
produced or isolated from whole viruses. These viral proteins suitable for use in virosomes or 
VLPs include proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such 
as core or capsid proteins). Hepatitis E virus, measles vims, Sindbis virus. Rotavirus, Foot-and- 
Mouth Disease virus, Retirovirus, Norwalk virus, human Papilloma virus, HEV, RNA-phages, 
Q6-phage (such as coat proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as 
retirotiransposon Ty protein pi). VLPs are discussed further in WO 03/024480, WO 03/024481, 
and Niikura et al., "Chimeric Recombinant Hepatitis E. Virus-Like Particles as an Oral Vaccine 
Vehicle Presenting Foreign Epitopes", Virology (2002) 293:273-280; Lenz et al., 
"PapiUomarivurs-Like Particles Induce Acute Activation of Dendritic Cells", Journal of 
Immunology (2001) 5246-5355; Pinto, et a}., "Cellular Immune Responses to Human 
Papillomavirus (HPV)-16 LI Healtiiy Volunteers Immunized witii Recombinant HPV-16 LI 
Virus-Like Particles", Journal of Infectious Diseases (2003) 188:327-338; and Gerber et al, 
••Human Papillomavrisu Virus-Like Particles Are Efficient Oral Immunogens- when 
Coadministered with Escherichia coli Heat-Labile Entertoxin Mutant R192G or CpG", Journal 
of Virology (2001) 75(10):4752.4760. Virosomes are discussed further in, for example. Gluck 
et al., "New Technology Platforms in tiie Development of Vaccines for tiie Future", Vaccine 
(2002)20:B10-B16. 

The invention may also comprise combinations of aspects of one or more of tiie adjuvants 
identified above. For example, tiie following adjuvant compositions may be used in tiie 
invention: 

(1) a saponin and an oil-in-water emulsion (W099/11241); 

(2) a saponin {e.g.., QS21) + a non-toxic LPS derivative {e.g., 3dMPL) (see WO 
94/00153); 

(3) a saponin {e.g.., QS21) + a non-toxic LPS derivative (e.g., 3dMPL) + a cholesterol; 

(4) a saponin {e.g. QS21) + 3dMPL + IL-12 (optionally + a sterol) (W098/57659); 

(5) combinations of 3dMPL witii. for example. QS21 and/or oil-in-water emulsions (See 
European patent appHcations 0835318, 0735898 and 0761231); 
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(6) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, 
and thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion. 

(7) Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% 
Tween 80, and one or more bacterial cell waU components from the group consisting of 
monophosphoryUpid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (DetoxTM); and 

(8) one or more mineral salts (such as an aluminum salt) + a non-toxic derivative of LPS 
(such as 3dPML). 

Aluminium salts and MF59 are preferred adjuvants for parenteral inomunisation. Mutant 
bacterial toxins are preferred mucosal adjuvants. 

As mentioned above, adjuvants suitable for use in the invention may also include one or 
more of the following: 

- E.coli heat-labile enterotoxin ("LT"). or detoxijSed mutants thereof, such as the K63 or 
R72 mutants; 

- cholera toxin ("CT"), or detoxified mutants thereof; 

. - microparticles (i.e., a particle of -lOOnm to -ISOjwm in diameter, more preferably 
~200nm to -SO^tm in diameter, and most preferably ~500nm to ~10/im in diameter) formed from 
materials that are biodegradable and non-toxic (e.g. a poly(a-hydroxy acid), a 
polyhydroxybutyric acid, a polyorthoester. a polyanhydride. a polycaprolactone etc.y, 

- a polyoxyethylene ether or a polyoxyethylene ester (see International patent application 
WO 99/52549); 

-a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol (see 
International patent application WO 01/21207) or a polyoxyethylene all^l ether or ester 
surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
(see Intemational patent application WO 01/21152); 

- chitosan (e.g. Intemational patent application WO 99/27960) 

- an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin (see 
Intemational patent application WO 00/62800) 

- immunostimulatory double stranded RNA. 

- aluminum compounds (e.g. aluminum hydroxide, aluminum phosphate, aluminum 
hydroxyphosphate, oxyhydroxide, orthophosphate, sulfate etc. {e.g. see chapters 8 & 9 of 
Vaccine design: tlie subunit and adjuvant aproach, eds. Powell & Newman, Plenum Press 1995 
(ISBN 0-306-44867-X) (hereinafter "Vaccine design"), or mixtures of different aluminum 
compounds, with the compounds taking any suitable form (e.g. gel, crystalline, amorphous etc.), 
and with adsorption being preferred; 
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- MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85. fonnulated into submicron 
particles using a rmcrofluidizer) (see Chapter 10 of Vaccine design; see also International patent 
application WO 90/14837); 

- liposomes (see Chapters 13 and 14 of Vaccine design); 

- ISCOMs (see Chapter 23 of Vaccine design); 

- SAP, containing 10% Squalane, 0.4% Tween 80, 5% pluroMc-block polymer L121, and 
thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger' 
particle size emulsion (see Chapter 12 of Vaccine design); 

- RibiTM adjuvant system (RAS). (Ribi Immunochem) containing 2% Squalene, 0.2% 
Tween 80, and one or more bacterial cell wall components from the group consisting of 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell waU skeleton (CWS) 
preferably MPL + CWS (Detox™); 

- saponin adjuvants, such as QuilA or QS21 (see Chapter 22 of Vaccine design), also 
known as Stimulon™; 

- ISCOMs, which may be devoid of additional detergent (WO 00/07621); 

- complete Freund's adjuvant (CFA) and incomplete Freund's adjuvant (WA); 

- cytokines, such as interleukins (e.g. IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL.I2' etc ) 
interferons (e.g. interferon-y), macrophage colony stimulating factor, tumor necrosis factor, etc 
(see Chapters 27 & 28 of Vaccine design); 

- monophosphoryl lipid A (MPL) or 3-O-deacylated MPL (3dMPL) (e.g. chapter 21 of ' • 
Vaccine design); 

- combinations of 3dMPL with, for example, QS21 and/or oil-in-water emulsions 
(European patent applications 0835318, 0735898 and 0761231); 

- oligonucleotides comprising CpG motifs (see Krieg (2000) Vaccine, 19:618-622; Krieg 
(2001) Curr. Opin. Mol. Ther., 2001, 3:15-24; WO 96/02555, WO 98/16247. WO 98/18810 WO 
98/40100, WO 98/55495. WO 98/37919 and WO 98/52581. etc.) Le. containing at least one CG 
dinucleotide, 

- a polyoxyethylene ether or a polyoxyethylene ester (Intemational patent application 
W099/52549); 

- a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol 
(Intemational patent application WO 01/21207) or a polyoxyethylene alkyl ether or ester 
surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
(WO 01/21152); 

- an immunostimulatory oUgonucleotide (e.g. a CpG oligonucleotide) and a saponin 
(WOOO/62800); ^ 
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- an immunostimulant and a particle of metal salt (Intemational patent application 
WOOO/23105); 

- a saponin and an oil-in-water emulsion (WO 99/1 1241); 

- a saponin (e.g. QS21) + 3dMPL + IL-12 (optionally + a sterol) (WO 98/57659). 
Other adjuvants suitable for mucosal or parenteral administration are also available {e.g. 

see chapter 7 of Vaccine design: the subunit and adjuvant aproach, eds. Powell & Newman, 
Plenum Press 1995 (ISBN 0-306-44867-X). 

Mutants of LT are preferred mucosal adjuvants, in particular the "K63" and '•R72" mutants 
ie.g. see Intemational patent application WO 98/18928), as these result in an enhanced immune 
response. 

Microparticles are also preferred mucosal adjuvants. These are preferably derived from a 
poly(a-hydroxy acid), in particular, from a poly(lactide) ("PLA"), a copolymer of D,L-lactide 
and glycohde or glycolic acid, such as a poly(D,L-lactide-co-glycolide) ("PLG" or "PLGA"), or 
a copolymer of D,L-lactide and caprolactone. The microparticles may be derived from any of 
various polymeric starting materials which have a variety of molecular weights and. in. the case 
of the copolymers such as PLG, a variety of lactide:glycolide ratios, the selection of which will 
be largely a matter of choice, depending in part on the coadministered antigen. 

The SARS virus (inactivated or attenuated), viral antigens, antibodies or adjuvants of the 
invention may be entrapped within the microparticles, or may be adsorbed to them. Entrapment 
within PLG microparticles is preferred. PLG microparticles are discussed in further detail in 
Morris et al, (1994), Vaccine, 12:5-11, in chapter 13 of Mucosal Vaccines, eds. Kiyono et al.. 
Academic Press 1996 gSBN 012410587), and in chapters 16 & 18 of Vaccine design: • tlie 
subunit and adjuvant aproach, eds. Powell & Newman. Plenum Press 1995 (ISBN 0-306-44867- 
X). 

LT mutants may advantageously be used in combination with microparticle-entrapped 
antigen, resulting in significantly enhanced immune responses. 

Aluminium compounds and MF59 are preferred adjuvants for parenteral use. 
The composition may include an antibiotic. 

The inmiunogenic compositions of the invention may be administered in a single dose, or 
as part of an administration regune. The regime may include priming and boosting doses, which 
may be administered mucosally, parenterally, or various combinations thereof. 

The metiiods of the invention further comprise tireating or preventing a SARS virus-related 
disease by administering to an animal a composition comprising an effective amount of die 
antibodies of the invention. An "effective amount" of the antibodies of the invention is an 
amount sufficient to provide passive inamunization protection or tireatment to the animal. 
Preferably, the antibodies of the invention are specific to the SARS viral antigen. 
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Methods of treatment may combine both immunogenic compositions and antibody 
compositions. Accordingly the invention comprises a method for treating or preventing a SARS 
vims-related disease comprising administering an immunogenic composition comprising an 
immunologically effective amount of a SARS viral antigen and administering an effective 
amount of antibodies specific to SARS viral antigen. The immunogenic composition and the 
antibodies may be administered together or separately. The invention further comprises a 
composition comprising an immunogenic composition comprising an immunologically effective 
amount of a SARS viral antigen and fiirther comprising an effective amount of antibodies 
specific to a SARS viral antigen. 

The SARS viral antigens and antibodies of the invention may also be administered in 
polynucleotide form. The SARS viral antigens and/or antibody proteins are then expressed in 
vivo. 

The SARS viral antigens and the antibodies of the invention can also be delivered using 
one or more gene vectors, administered via nucleic acid immunization or the like using- standard 
gene delivery. protocols. Methods for gene delivery are known in the art. See, e.g., US Patent 
Nos. 5,399,346, 5,580,859, 5,589,466. The constructs can be delivered {e.g., injected) either 
subcutaneously, epidermally, intradennally, intramuscularly, intravenous, mucosally (such as 
nasally, rectaUy and vaginally), intraperitoneaUy, orally or combinations thereof. Intramuscular 
injection of 25/*g plasmid DNA encoding spike antigens, in 200/til PBS pH 7.4, at weeks 0, 3 and . 
6, has been described for mice by Yang et al. (2004) Nature 428:561-564. 

An exemplary replication-deficient gene delivery vehicle that may be used in the practice 
of the present invention is any of the alphavirus vectors, described in, for example, US Patent 
Nos. 6,342,372; 6,329,201 and International Publication WO 01/92552. 

A number of viral based systems have been developed for gene transfer into mammalian 
cells. For example, retroviruses provide a convenient platform for gene delivery systems. 
Selected sequences can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered to cells of 
the subject either in vivo or ex vivo. A number of retroviral systems have been described (US 
Patent No. 5,219,740; Miller & Rosman, BioTechniques (1989) 7:980-990; Miller, A.D., Human 
Gene Tlierapy (1990) 1:5-14; Scarpa et al.. Virology (1991) 180:849-852; Bums et al, Proc. 
Natl Acad. Sci. USA (1993) 90:8033-8037; and Boris-Lawrie & Temin, Cur. Opin. Genet. 
Develop. (1993) 3:102-109. 

A number of adenovirus vectors have also been described. Unlike retroviruses which 
integrate into the host genome, adenoviruses persist extrachromosomally thus minimizing the 
risks associated with insertional mutagenesis (Haj-Ahmad and Graham, /. Virol. (1986) 57:267- 
274; Bett et al, J. Virol (1993) 67:591 1-5921; Mitteieder et al. Human Gene Tlierapy (1994) 
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5:717-729; Seth et al, J. Virol (1994) 68:933-940; Barr et al.. Gene Therapy (1994) 1:51-58; 
Berkner, K.L. BioTechniques (1988) 6:616-629; and Rich et al.. Human Gene Therapy (1993) 
4:461-476). Adenoviral delivery of codon-optimsed versions of the genes encoding SARS 
coronavirus structural antigens spike SI, membrane protein and nucleocapsid protein has been 
5 investigated in rhesus macaques and found to invoke a strong neutralizing antibody response 
(Gao et al. (2003) Lancet 362(9399): 1895-1 896). 

Additionally, various adeno-associated virus (AAV) vector systems have been developed 
for gene delivery. AAV vectors can be readily constructed using techniques well known in the 
art. See, e.g., US Patent Nos. 5,173,414 and 5,139,941; International Publication Nos. WO 

10 92/01070 (published 23 January 1992) and WO 93/03769 (published 4 March 1993); Lebkowski 
et al., Molec. Cell. Biol (1988) 8:3988-3996; Vincent: al. Vaccines 90 (1990) (Cold Spring 
Harbor Laboratory Press); Carter, B.J. Current Opinion in Biotechnology (1992) 3:533-539; 
Muzyczka, N. Current Topics in Microbiol and Immunol (1992) 158:97-129; Kotin, R.M. 
Human Gene Therapy (1994) 5:793-801; Shelling and Smith, Gene Therapy (1994) 1:165-169; 

15 and Zhou era/., 7. Exp. Med (1994) 179:1867-1875. : 

Another vector system useful for delivering polynucleotides, mucosally and otherwise, is 
the enterically administered recombinant poxvirus vaccines described by Small, Jr., P.A., et al. 
(US Patent No. 5,676,950, issued October 14, 1997, herein incorporated by reference) as well as 
the vaccinia virus and avian poxviruses. By way of example, vaccinia virus recombinants 

10 expressing the genes can be constructed as follows. The DNA encoding the SARS antigen or 
antibody or antibody coding sequence is first inserted into an appropriate vector so that it is 
adjacent to a vaccinia promoter and flanking vaccinia DNA sequences, such as the sequence 
encoding thymidine kinase (TK). This vector is then used to transfect cells that are 
simultaneously infected with vaccinia. Homologous recombination serves to insert the vaccinia 

15 promoter plus the gene encoding the coding sequences of interest into the viral genome. The 
resulting TK recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can also be used 
to deliver genes encoding the SARS viral antigens or antibodies of the invention. Recombinant 

•0 avipox viruses, expressing inmiunogens from mammalian pathogens, are known to confer 
protective immunity when administered to non-avian species. The use of an avipox vector is 
particularly desirable in human and other mammalian species since members of the avipox genus 
can only productively replicate in susceptible avian species and therefore are not infective in 
mammalian cells. Methods for producing recombinant avipoxviruses are known in the art and 

5 employ genetic recombination, as described above with respect to the production of vaccinia 
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Viruses. See, e.g., WO 91/12882; WO 89/03429; and WO 92/03545. Picomavirus-derived 
vectors can also be used. (See, e.g., US Patent Nos. 5,614,413 and 6,063,384). 

Molecular conjugate vectors, such as the adenovirus chimeric vectors described in Michael 
et al, J. Biol Chem. (1993) 268:6866-6869 and Wagner et al, Proc. Natl Acad. ScL USA 
(1992) 89:6099-6103, can also be used for gene delivery. 

A vaccinia based infection/transfection system can be conveniently used to provide for 
inducible, transient expression of the coding sequences of interest (for example, a SARS yiral 
antigen or antibody expression cassette) in a host cell. In this system, cells are first infected in 
vitro with a vaccinia virus recombinant that encodes the bacteriophage T7 RNA polymerase. 
This polymerase displays exquisite specificity in that it only transcribes templates bearing T7 
promoters. Following infection, cells are transfected with the polynucleotide of interest, driven 
by a T7 promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant tiranscribes the ti:ansfected DNA into RNA that is then translated into protein by Oie - . 
host translational machinery. The method provides for high level, teansient, cytoplasmic 
production of large quantities of RNA and its tiranslation products. See, e.g., Elroy-Stein and 
Moss, Proc. Natl Acad. Sci. USA (1990) 87:6743-6747; Fuerst et al., Proc. Natl Acad: ScL USA 
(1986) 83:8122-8126. 

As an alternative approach to infection with vaccinia or avipox virus recombinants, or to 
tiie deUvery of genes using other viral vectors, an amplification system can be used that will lead 
to high level expression following introduction into host cells. Specifically, a T7 RNA 
polymerase promoter preceding the coding region for T7 RNA polymerase can be engineered. 
Translation of RNA derived firom this template will generate T7 RNA polymerase tiiat in ttim 
will tiranscribe more template. Concomitantly, there will be a cDNA whose expression is under 
tiie contirol of the T7 promoter. Thus, some of the T7 RNA polymerase generated firom 
tiranslation of the amplification template RNA will lead to transcription of tiie desired gene. 
Because some T7 RNA polymerase is required to initiate tiie ampUfication. T7 RNA polymerase 
can be intix)duced into cells along witii tiie template(s) to prime tiie ti^scription reaction. The 
polymerase can be intix)duced as a protein or on a plasmid encoding the RNA polymerase. For a 
fijrther discussion of T7 systems and tiieir use for ti-ansforming cells, see, e.g.. International 
Publication No. WO 94/26911; Studier and Moffatt, J. Mol Biol (1986) 189:113-130; Deng and 
Wolff, Gene (1994) 143:245-249; Gao et al, Biochem. Biophys. Res. Commmu (1994) 
200:1201-1206; Gao and Huang, Nuc. Acids Res. (1993) 21:2867-2872; Chen et al, Nuc. Acids 
Res. (1994) 22:21 14-2120; and US Patent No. 5,135,855. 

The immunogenic compositions of the invention may further comprise diluents, such as 
water, saline, glycerol, etiianol, etc. Additionally, auxiliary substances, such as wetting or 



-175- 



wo 2004/092360 PCT/US2004/011710 

emulsifying agents, pH buffering substances, and the like may be included in the immunogenic 
composition. 

The immunogenic compositions used in the invention can be administered to an animal. 
Animals suitable for use in the methods of the invention include humans and other primates, 
5 including non-human primates such as chimpanzees, and other apes and monkey species; farm 
animals such as cattle, sheep, pigs, goats and horses, domestic animals such as dogs and cats; 
laboratory animals including rodents such as mice, rats and guinea pigs; birds, including 
domestic, wild and game birds such as chickens, turkeys and other gallinaceous birds, ducks, 
geese and the like. Animals suitable for use in die invention can be of any age, including both 
10 adult and newborn. Transgenic animals can also be used in the invention. 

The inununogenic compositions of the invention can be used to treat or prevent SARS 
virus-related diseases. 

The compositions of the invention are preferably pharmaceutically acceptable and 
pharmacologically acceptable. In particularly, the compositions are preferably not biologically 
5 or otherwise undesirable, i.e. , the material may be administered to an individual in a formulation • 
or composition without causing any undesirable biological effects or interacting in a deleterious 
manner with any of the components of the composition in which it is contained. 

Pharmaceutically acceptable salts can also be used in compositions of the invention, for 
example, mineral salts such as hydrochlorides, hydrobromides, phosphates, or sulfates, as well as 
0 salts of organic acids such as acetates, pioprionates, malonates, or benzoates. Especially useful 
protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, . 
thyroglobulin, ovalbumin; tetanus toxoid, and otiier proteins well known to those of skill in the 
art. Compositions of the invention can also contain liquids or excipients, such as water, saline, 
glycerol, dexti-ose, ethanol, or the like, singly or in combination, as well as substances such as 
5 wetting agents, emulsifying agents, or pH buffering agents. Liposomes can also be used as a 
carrier for a composition of the invention. 

SARS specific reagents and analytical assays may be used in the manufacture and testing 
of the vaccines of the invention. Such analytical assays include, for example: 1) virus titration 
and plaque assays for quantitation of infectious virus particles, 2) a neutiralization assay with 
) constant virus and varying serum dilutions, 3) a two step RT-PCR system OLight Cycler-Roche) 
for detection of negative stirand viral RNA, with die target sequence located within the N gene, 
providing highest possible sensitivity, and 4) EUSA and western blot assays for detection and 
qualification of viral proteins. 

In addition, rabbit polyclonal antiserum has been generated to obtain antibody reagents 
! (and demonsti-ate induction of neutiralizing antibodies) against the SARS-CoV. A sample 

protocol for generating such reagents is set forth below. The virus is first cultivated in suitable 
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ceU culture, such as Vero cells, and pelleted through a 20% sucrose (w/v) cushion. The pellet is 
then subjected to a glycerol potassium-tartmte gradient for further purification. The virus- 
contaming fraction is then diluted and pelleted by ultracentrifugation. The pellet is then 
dissolved in PBS and the virus is inactivated with C3H4O2 (beta-propiolactone, BPL) Two 
. rabbits are innnunized subcutaneously (SC) on day 0, 14, and 28 with 1x10^ inactivated viral 
particles mixed with IFAas adjuvant. Rabbits are bled on days 0 (pre-inoculation). 13, 28 and 
35 (1 week after 3rd immunization). Sera obtained from this protocol were tested for their' 

reactivityagainstSARS-CoVproteinsinwestemblotsandfoundtoreactwiththemajor ' 
structural proteins spike (S), membrane (M). and nucleocapsid (N). 

coronav irus vacairwx 

The SARS epidemic has lead to increased awareness of vital infections caused by 
cotonavixuses. The vaccines of the invention may be adapted to prevent or treat emerging strains 
ot coronaviius, mcluding emerging strains of "SARS virus. 

The invention provides a vaccine compdsing an inactivated (or killed) human coronavirus 

an attenuated human coronavirus, a split human coronavirus preparation, or arecombinant or " 
purrfied subunit formulation of one or more antigens fiom a hm^n coronavirus, wherein the 
human coronavirus is not the SARS coronavirus. Optiomdly, the human coronavirus is not the 
229E coronavirus. Optionally, the human coronavirus is not fl,e OC43 coronavirus. OptionaUy 

tt»hun«ncoronavirusisnottheNI^3coronavirus.Thustheinventionprovidesavaccineas ' - 
defined above, wherein the human coronavirus is not the SARS coronavirus. is not the 229E 
co«>navin.s, is not the OC43 coronavirus and is not the NL63 coronavirus. Such vaccines a« 
useful for preventing and/or treating emerging human coronavirus infections. 

The invention also provides a vaccine comprising: (a) an inactivated (or MUed) human 
coronavirus, an attenuated human coronavirus, a spUt human comnavirus preparation or a 
recombinant or purified subunit formuladon of one or more antigens from a human coronavirus 
wherem the human coronavnus is not fte SARS coronavirus. as defined above; and (b) an ' 
inactivated (or killed) human coronavirus, an attenuated human co^navirus, a split human 
coronavirus preparation, or a recombmant or purified subunit formulation of one or more 
^gens from a human coronavirus, whe«in the human coronavirus is the SARS coronavirus 
Such vaccines are useful for preventing and/or treating both SARS and other human 
coronaviruses. 



As well as providing vaccines comprising antigens from more than one type of 
co^navims, the invention also provides vaccines comprising antigens from more than one strain 
of the same coronavirus e.g. different stiainsof theSARS coronavirus, or different strains of a 
cotonavm. other than the SARS cotonavirus. m one embodiment, the vaccine comprises 
antigens from at least two sti:ains of coronavirus, or at least three strains of coronavirus m one 
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embodiment, the vaccine comprises antigens from at least two types of coronavims. In one 
embodiment, the vaccine comprises at least one antigen from each of the known types of 
coronaviruses (type I, type H and type IE). Such vaccines follow the model of current influenza 
vaccines. 

The selection of coronaviruses and/or coronavirus strains for use in vaccines of the 
invention can be based on various criteria. For instance, selection may be based on viruses and/or 
strains that have been detected in the geographical region (e.g. northern or southern hemisphere, 
a particular country, etc.) where the vaccine targeted. Selection may be based on the results of 
animal surveillance e.g. of viruses detected in cat populations. Selection may be based on the 
results of clinical surveiUance e.g. of viruses detected in patients hospitalized with respiratory 
infection. Selection may be performed every year e.g. prior to winter. Vaccines may also be 
administered yearly, again following the model of current influenza vaccines. 

Preferred vaccines are sufficiently immunogenic to provide a neutralizing immune 
response, and more preferably a protective and/or therapeutic immune response. Particularly 
preferred vaccines meet the efficacy requirements that may be specified by the WHO from time 
to time. 

A preferred subunit antigen for inclusion in vaccines of the invention is a purified spike 
protem, more preferably in oligomeric (e.g. trimeric) form. The spike protein may or my not be , 
cleayed e.g. into its SI and S2 products. 

The techniques disclosed above for selecting viruses and/or strains for production of 
vaccines can also be used to select appropriate viruses and/or strains from which HRl and HR2 
sequences can be obtained for providing therapeutic peptides, as disclosed above. 

///. DIAGNOSTIC COMPOSITIONS AND METHODS OF THE INVENTION 

The invention provides methods for detecting the S ARS coronavirus. Detection in patient 
samples can be used to detect and diagnose infections by the virus. Detection in donated blood 
can be used to prevent inadvertent transmission of the virus during blood transplant procedures 
Detection methods fall into three main categories: detection of SARS vims nucleic acids; 
detection of SARS virus proteins; and detection of anti-SARS virus immune responses. The 
invention provides all such methods. 

As used herein when referring to nucleotide sequences, particularly oligonucleotide probes 
and primers, "similar" sequences includes those sequences that are at least 90% identical to 
known SARSV genomic sequence and includes sequences that are at least;95 % identical, at 
least 99% identical and 100% identical to the SARSV genomic sequence over the length of the 
probe or primer. 
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To a sluiry of LiAlEU (4 equivalents) in dry THF, cooled to 0 °C under N2, was slowly 
added drop-wise a solution of methyl 4-[4-(l-BOC-piperazin-4-yl)phenoxymethyl]ben2oate (1 
equivalent) in dry THF. Once the addition was complete, the slurry was heated to reflux at 80 
°C for 1 hour. The slurry was subsequently cooled to 0 °C and treated with water, 10% aq. 
NaOH and with water again. The resulting solids were filtered, and the filtrate was diluted with 
chloroform, washed with brine, dried over MgS04 and concentrated, providing the crude 4-[4-(4- 
methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol that was used without purification-; 

To a solution of DMSO (2.6 equivalents) in dry DCM , cooled to -78 °C under N2 was 
added oxalyl chloride (1.1 equivalents) in DCM drop-wise. The solution was stirred at -78 °C 
for 5 minutes before a solution of 4- [4-(4-methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol 
(1 equivalent) in DCM was added drop-wise, and allowed to stir at -78 "C for another 30 
mim^tes. Triethylamine.(2.5 equivalents) was slowly dripped in before allowing the solution to 
reach anibient temperatures. The solution was washed with aqueous NaHCOs and brine, dried 
over MgS04 and concentrated to provide the crude 4-[4-(4-methylpiperazin-l- 
yl)phenoxymethyl]benzaldehyde that was converted to tiiiosemicarbazones according to Scheme 
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PYRROLES 

Scheme 15 
Synthesis of Pyrrole 




Preparation of tert-hutyl (2E)-3-(2,4-dichlorophenyl)prop-2-enoate (2). 

Neat Die (1.4 eq) was added to a well stirred solution of cinnamate (1 eq), t-butyl 
alcohol (4 eq), DMAP (1 .4 eq) and CH2CI2 under argon at rt. (Note - The cinnamate must be 
completely in solution that may require gentle warming. Allow the solution to cool to room 
temperature before adding the DIG. To avoid an exotherm on larger scales, it may be prudent to 
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dilute the DIG with CH2CI2 before the addition and have an ice bath ready.) After stirring for 8 
hours, the reaction develops a white precipitate. The reaction may be monitored by TLC eluting 
with 25% EtOAc/Hexane (Rf of product was 0.9). The entire reaction was loaded into a 
separatory funnel (washing with CH2CI2). The organic mixture was washed with citrate, sat. aq. 
NaHCOa, water, and brine. The organic layer was dried (Na2S04), filtered, and concentrated to 
dryness to giye the crude product as an oil. The crude oil was mixed with hexane and stirred for 
30 min. The precipitate that forms was filtered over celite and the filtrate was evaporated. The 
hexane mixture was loaded onto a filter plug of silica and eluted with EtOAc/hexane (97:2 v/v). 
The first eluted UV active fractions are collected and evaporated to give >99% pure 2 (75-80% 
yields). 




3 



Preparation of tert-hutyl 4-(2,4-dichlorophenyl)pyrrole-3-carboxylate (3). 

Dry ether was added to NaH (1.5 eq as the oil dispersion) under argon. After decanting 
off the ether via syringe, the NaH was suspended again with fresh ether under argon. A solution 
of TOSMIC (1.1 eq) and 2 (1 eq) dissolved in a mixture of ether and DMSO was added dropwise 
to the stirred suspension of NaH at 0 °C over 20-30 min. The addition was mildly exothermic 
and evolved gas. After the addition, the reaction was allowed to warm to ambient rt. The 
progress of the reaction was followed by TLC (25% EtOAc/Hexane, the UV active product was 
at Rf = 0.4) and LCMS until done (-2-3 h). Upon completion, the reaction was carefully 
quenched with sat. aq. NH4CI (added slowly to avoid strong gas evolution and exotherm) and 
diluted with ether. The layers were separated and the organic phase was washed with sat. aq. 
NaHCOs, water, and brine. The crude dark solid can be purified by recrystallization. Best 
results were achieved either through recrystallization directly from a mixture of hot 
EtOAc/hexane (1:3 v/v) or by dissolving the crude product in minimal hot EtOAc followed by 
addition of hexane (~2 volumes of hexane based on the volume of EtOAc). The hot solutions 
were allowed to cool to room temperature and age over night. The crystals were first filtered and 
then washed with hexane giving 99% pure product in 60-70 % yield. 
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cr 



CL 




Br 




H 



NaH. DMF, 
RT. 70% yield 



3 



4 



Preparation of tert-hutyl 4-(2,4-dichlorophenyl)-l-[3-(l^-dioxobeiizo[c]azolin-2- 
yl)propyl]pyrrole-3-carboxylate (4). 

Solid NaH (1.5 eq as the oil dispersion) was added in small portions to a solution of 
pyrrole 3 (1 eq) and S-bromopropyl phthalimide (1.2 eq) dissolved in r>MF stirred at room 
temperature and flushed with argon. NOTE - Some gas evolves, but the temperature does not 
seem to rise above 40-50 °C. The reaction was stirred for 1.5 h at room temperature under argon. 
The reaction was followed by TLC (CH2Cl2/acetonitrile (95:5 v/v), the UV active product was at 
Rf = 0.5) and LCMS. Upon completion, the reaction was quenched witih sat. aq. NH4CI (add 
slowly to avoid strong gas evolution and exotherm). Sat. aq. NaHCOa was then added to avoid 
an emulsion, and the basic organic mixture was extracted with ether. The combined ether layers 
were washed with sat. aq. NaHCOs, water, brine, dried Na2S04, filtered, and concentrated to 
dryness to give the crude product. The crude product was purified by eluting through silica with 
EtOAc/Hexane (1 :4 v/v). The purified product contained some residual 3-bromopropyl 
phthalimide, that did not interfere with subsequent synthetic steps. The material was taken on 
and used without further purification. Assume a quantitative yield- 



Preparation of tert'hutyl l-(3-aniinDpropyl)-4-(2,4-dichlorophenyI)pyrro!e-3-carboxylate 
(5). 



The Pthalimido Pyrrole 4 (1 eq) was dissolved in ethanol and hydrazine (3 eq) at room 
temperature under nitrogen. Upon heating to reflux, the reaction generated a white precipitate. 
Stir at reflux until complete (~2 h) by TLC (CHaCla/acetonitrile (95:5 v/v), the UV active 
product was at Rf = 0.2) and LCMS. Upon reaching completion, the reaction was allowed to 
cool to room temperature and the precipitate was vacuum-filtered off using a medium to fine 
cintered-glass filter. The filtrate was concentrated under reduced pressure to a gummy solid. 




4 



s 
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The crade material was taken up in ethanol/EtOAc (1:1 v/v), stirred and the precipitate was 
filtered off in the same fashion as before. The filtrate was concentrated under reduced pressure 
and than dried in vacuo for 10-15 min. This process of adding ethanol/EtOAc, filtering and 
concentrating was done one more time or as needed to remove the majority of the white 
precipitate and residual hydrazine. The product was then dried in vacuo overnight The material 
was used without further purification. Once dried, the reaction yielded the product as a glass 
(-87% yield over 2 steps). _ - 



10 



15 





IHo Hunig's Base, DMA, 
^ 80 C. 58-60% yield 



Preparation of te/t-butyl l-{3-[(6-aiiimo-5-nitro(2-pyridyI))ainino]propyl}-4-(2 
dichlorophenyI)pyrroIe-3-carboxyIate (7). 

To the premixed dry reagents, pyrrole 5 (1 eq) and powdered 6-chloro-3-nitro-2- 
pyridylamine (6) (1.1 eq), was added the DMA followed by Hunig's base (2 eq) sequentially with 
stirring at rt. The reaction was then hbated to 80 °C overnight. The reaction was foUowed by 
TLC (EtOAc/hexane (1:1 v/v), the UV active yellow product was at Rf = 0.25), HPLC and 
LCMS. Upon completion as judged by HPLC, the reaction was allowed to cool to 70 °C. 
Ethylene diamine (anhydrous) was then added to the reaction to destroy any remaining unreacted 
chloropyridine 6. After 15 min stirring at 70 "C, the reaction was cooled and quenched with the 
addition of sat. aq. NaHCOa. The aqueous mixture was extiracted with EtOAc, and tiie combined 
organic layers were washed with sat. aq. NaHCOs, water, brine, dried, filtered, and concenti-ated 
to dryness to give the crude product as a brown-yellow soUd. The crude product was purified by 
flash chromatography eluted with EtOAc/hexane (4:6 v/v). The purified SnAr adduct 7 was 
isolated in 58% yield as a yellow solid. 



25 




TFA (10% v/v) 

CH2CI2. 
1% water, 
RT, -99% yield 
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Preparation of l-{3-[(6-amino-5-mtro(2-pyridyl))amino]propyl}-4-(2,4- 
dichlorophenyl)pyrrole-3-carboxylic acid (8). 

In a vial, TFA (catalytic amount) was added to a stirred mixture of ^^^r^-butyl ester pyrrole 

7 (1 eq), water (.1%), and CH2CI2 at rt. The vial stirred at room temperature until done (-12 h. 

5 The reaction was then concentrated under reduced pressure at room temperamre and dried in 

vacuo. The crude residue was dissolved again in CH2CI2 and concentrated under reduced 

pressure at rt. The material was used in the final coupling step witiiout further purification as the 

TFA salt. 




CHIR 154703 

10 

Preparation of N-((lS)-2-hydroxy-isopropyI)(l-{3-[(6-aimno-5-iiitro(2- 
pyridyI))amino]propyl}-4-(2,4-dichIorophenyI)p3nrrol-3-yl)carboxamide (9,). 

(2S)-(+)-2-Aminopropan-l-ol (1.5 eq) was added to a stirred mixture of acid (8) (1 eq), 
HBTU (1.5 eq), HUnig's base (2 eq) and DMF (premixed sequentially in this order in a vial) at 
15 room temperature under argon. The reaction was stirred for 3-4 h until complete as shown by 
LGMS and HPLC. The reaction mixture was subsequently diluted with EtOAc, washed with 
NaHCOa, and concentrated to afford a powder in a 70% yield. 

Nomenclature for the Example compounds was provided using ACD Name version 5.07 
software (November 14, 2001) available from Advanced Chemistry Development, Inc. Some of 
20 the compoimds and starting materials were named using standard lUPAC nomenclature. • 

The compounds of Table 34 were synthesized following the synthetic methodology 
described above in the Examples and Schemes, and screened following methods 1 and 2 below. 
The precursors are readily recognizable by one skilled in the art and are commercially available 
from Aldrich (Milwaukee, WI) or Acros Organics (Pittsburgh, PA), among others. 

25 

Screening methods for SMIP/SMIS compounds 

Method 1 

Candidate small molecule immuno-potentiators can be identified in vitro. Compounds 
are screened in vitro for their ability to activate imotnune cells. One marker of such activation is 
30 the induction of cytokine production, for example TNF-a production. Apoptosis inducing small 
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molecules may be identified having this activity. These small molecule immuno-potentiators 
have potential utility as adjuvants and immuno-therapeutics. 

In an assay procedure (High Throughput Screening (HTS)) for small molecule inomune 
potentiators (SMIPs), human peripheral blood mononuclear cells (PBMC), 500,000 per mL in 
RPME 1640 medium with 10% PCS, were distributed in 96 well plates (100,000 per well) 
aheady containing 5fjM of compound in DMSO. The PBMCs were incubated for 18 h at 37°C ir 
5% CO2. Their ability to produce cytokines in response to the small molecule compounds is 
determined using a modified sandwich ELISA. 

Briefly supematants from the PBMC cultures were assayed for secreted TNF using a 
primary plate bound antibody for capture followed by a secondary biotinylated anti-TNF 
antibody forming a sandwich. The biotinylated second antibody was then detected using 
streptavidin-Europium and the amount of bound europium was determined by time resolved 
fluorescence. SMIP compounds were confirmed by their TNF inducing activity that was. 
measured in the assay as increased Europim counts over cells incubated in RPMI medium alone. 
"Hits" were selected based on their TNF-inducing activity relative to an optimal dose of 
Upopolysaccaride UPS (1 ng/nH), a strong TNF inducer. The robustness of the assay and low 
backgrounds allowed for the routine selection of hits with -10% of LPS activity that was 
normally between 5-lOX background (cells alone). Selected hits are then subjected to 
confirmation for their ability to induce cytokines firom multiple donors at decreasing 
concentrations. Those compounds with consistent activity at or below 5piM are considered 
confirmed for the purposes of this assay. The assay is readily modified for screening for 
compounds effective at higher or lower concentrations. 

Method 2 

Each of the compounds in the above Table 34 elicited TNF-a production in human 
peripheral blood mononuclear cells. Many of the compounds showed activity at less than 20 \xM 
with respect to production of TNF-a. Many of these compounds showed activity at less than 5 
liM with respect to production of TNF-a. Many of these compounds showed activity in the 
production of TNF-a at less than 1.5 \iM. 

For this reason, each of the R groups of any of the compounds listed in Table 34_are 
preferred. Additionally, because of the excellent activity of each of the compounds, each of 
these compounds is individually preferred and is preferred as a member of a group that includes 
any or all of the other compounds and each compound is preferred in methods of modulating 
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immunopotentiation and in methods of treating biological conditions associated therewith, for 
example to be used as a vaccine adjuvant. Each of the compounds is also preferred for use in 
preparation of medicaments for vaccines, immunopotentiation, reducing tumor growth and in 
treating biological conditions mediated therefrom. 
5 In additon to the procedure described above, methods of measuring other cytokines (e.g. 

ILl-beta, IL-12, IL-6, IFN-gamma, IL-10 etc.) are well known in the art and can be used to find 
active SMIP compounds of the present invention. - - 

Compounds may be useful that cause production of TNF-a at higher concentrations, such 
as lOOpM, 200 joM or 300|iM in the assays described herein. For example Loxoribine causes 
10 useful production of TNF-a at 300|jM (see Pope et al Immunostimulatory Compound 7-Allyl-8- . 
Oxoguanosine (Loxoribine) Induces a Distinct Subset of Murine Cytokines Cellular 
Immunology 162: 333-339 (1995)). 

The subject invention also includes isotopically-labeled antiviral compounds, that are 

15 structurally identical to those disclosed above, but for the fact that one or more atoms are 

replaced by an atom having an atomic mass or mass number different from the atomic mass or 
mass number usually found in nature. Examples of isotopes that can be incorporated into 
antiviral compounds of the invention include isotopes of hydrogen, carbon, nitrogen, oxygen, 
phosphorous, sulfur, fluorine said chlorine, such as ^H, ^ "C, ^*C, *^N, ^*0, "O, '^S, 

20 and ^^Cl, respectively. Antiviral compounds of the present invention, derivatives thereof, and 

pharmaceuticaliy acceptable salts of said compounds and of said derivatives that contain the 
aforementioned isotopes and/or other isotopes of other atoms are within the scope of this 
invention. Certain isotopically-labeled antiviral compounds of the present invention, for example 
those into which radioactive isotopes such as and ^''C are incorporated, are useful in drug 

25 and/or substrate tissue distribution assays. Tritiated, i.e., ^H, and carbon- 14, /.e., ^'^C, isotopes are 
particularly preferred for their ease of preparation and detectability. Further, substitution with 
heavier isotopes such as deuterium, ^EE, may afford certain therapeutic advantages resulting 
from greater metabolic stability, for example increased in vivo half-life or reduced dosage 
requirements and, hence, may be preferred in some circumstances. Isotopically labeled antiviral 

30 compounds of this invention and derivatives thereof can generally be prepared by carrying out 
known or referenced procedures and by substimting a readily available, isotopically labeled 
reagent for a non-isotopically labeled reagent. 

In accordance with the present invention, methods are provided for the administration of an 
effective amount of a SMIP compound to act as an adjuvant. Also provided are immunogenic 
35 compositions comprising a SMIP compound, an antigen, and optionally other adjuvants. 
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As adjuvants, the SMIP compounds are combined with antigens and deUvery systems to 
foim a final immunogenic composition or vaccine product. 

As immunotherapeutics, the SMIP compounds are used alone or in combination with 
other therapies for treatment of SARS. 

Those of ordinary skill in the art will recognize that physiologicaUy active antiviral 
compounds, SMIPs or SMISs that have accessible hydroxy groups are fiequentiy administered in 
the form of pharmaceuticaUy acceptable esters. The antiviral compounds of this invention can 
be effectively administered as an ester, formed on the hydroxy groups, just as one skiUed in 
pharmaceutical chemistry would expect. It is possible, as has long been known in 
pharmaceutical chemistry, to adjust the rate or duration of action of the antiviral compound by 
appropriate choices of ester groups. 

Other compounds that can be used in combination with the therapeutic agents described 
herein include, pentoxifylline (PTX), methylprednisolone, trunetrexate (Neutrexin), Zadaxin 
(thymosin alpha 1), optionally substituted 5-aminomethinimino-3-methyl-4-isoxazolecarboxylic 
acidphenylamides, cyclosporine A (CsA), 6-oxo-l,4,5-thiadiazin[2,3-i»]quinazoline, 3-amino- 
2(li?)-thioxo-4(3fl)-quinazohnone, gangciclovir, glycyrrhizin, tetracyclines, aminoglycosides, 
quinolones, bicyclam (l,4-Bis(l,4.8,ll-tetraazacyclotetradec-l-ylmethyl)benzene 
octahydrochloride dihydrate), rapamycin, wortmannin, enalapril, roquinimex/Unomide,. 
inactivin, DNCB, AG7088, 9-aminocamptothecin (CPT-11) , loxorobme, bropirimine, Ononase 
® (ranpimase), statins, such as: lovastatin-Mevacor®. pravastatin-Pravachol®, simvastatin- 
Zocor®, fluvastatin-Lescol®, atorvastatin— lipitor® and rosuvastatin-Crestor®. 

As used herein, the term "effective amount" means an amount of antiviral compound of the 
compositions, kits and methods of the present invention that is capable of treating the symptoms 
of the described conditions. The specific dose of a compound administered according to this 
invention will, of course, be determined by the particular circumstances surrounding the case 
including, for example, the compound administered, the route of administration, the state of 
being of the patient, and the severity of the condition being treated. 

The dose of an antiviral compound of this invention to be administered to a subject is 
rather widely variable and subject to the judgment of the attending physician. It should be noted 
that it may be necessary to adjust the dose of a compound when it is administered in the form of 
a salt, such as a laureate, the salt forming moiety of which has an appreciable molecular weight. 

The following dosage amounts and other dosage amounts set forth elsewhere in this 
description are for an average human subject having a weight of about 65 kg to about 70 kg. The 
skilled practitioner will readily be able to determine the dosage amount required for a subject 
whose weight falls outside the 65 kg to 70 kg range, based upon the medical history of the 
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subject and the presence of diseases, e.g., diabetes, in the subject. Calculation of the dosage 
amount for other forms of the free base form such as salts or hydrates is easily accomplished by 
performing a simple ratio relative to the molecular weights of the species involved. 

in general, the pharmaceutical compositions will include at least one antiviral compound 
5 in combination with a pharmaceutically acceptable vehicle, such as saline, buffered saline, 5% 
dextrose in water, borate-buffered saline containing trace metals or the like. Formulations may 
further include one or more excipients, preservatives, solubilizers, buffering agents, lubricants, 
fillers, stabilizers, etc. Methods of formulation are well known in the art and are disclosed, for 
example, in "Remington's Pharmaceutical Sciences," Mack Pub. Co., New Jersey (1991) or 

10 "Remington: The Science and Practice of Pharmacy," 20^ ed., Lippincott Williams & Wilkins, 
Baltimore, Maryland (2000), incorporated herein by reference. 

Pharmaceutical compositions for use within the present invention can be in the form of 
sterile, non-pyrogenic liquid solutions or suspensions, coated capsules, suppositories, lyophilized 
powders, transdermal patches or other forms known in the art. 

15 Many of the active ingredient antiviral compounds are known to be absorbed from the 

alimentary tract, and so it is usually preferred to administer a compound orally for reasons of 
convenience. However, the compounds may equally effectively be administered intravenously, 
subcutaneously, percutaneously, or as suppositories for absorption by the rectum or vagina, if 
desired in a given instance. All of the usual types of compositions may be used, including 

20 tablets, chewable tablets, capsules, solutions, parenteral solutions, troches, suppositories and 
suspensions. Compositions are formulated to contain a daily dose, or a convenient fraction of 
daily dose, in a dosage unit, that may be a single tablet or capsule or convenient volume of a 
liquid. 

Capsules are prepared by mixing the compound or compounds with a suitable diluent and 
25 filling the proper amount of the mixture in capsules. The usual diluents include inert powdered 
substances such as starch of many different kinds, powdered cellulose, especially crystalline and 
microcrystalline cellulose, sugars such as fructose, mannitol and sucrose, grain flours and similar 
edible powders. 

Tablets are prepared by direct compression, by wet granulation, or by dry granulation. 

30 Their formulations usually incorporate diluents, binders, lubricants and disintegrators as well as 
the compound or compounds. Typical diluents include, for example, various types of starch, 
lactose, mannitol, kaolin, calcium phosphate or sulfate, inorganic salts such as sodium chloride - 
and powdered sugar. Powdered cellulose derivatives are also useful. Typical tablet binders are 
substances such as starch, gelatin and sugars such as lactose, fructose, jglucose and the like. 

35 Natural and synthetic gums are also convenient, including acacia, alginates, methylcellulose, 
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polyvinylpyrroUdine and the like. Polyethylene glycol, ethylcellulose and waxes can also serve 
as binders. 

A lubricant is generally necessary in a tablet fonnulation to prevent the tablet and punches 
from sticking in the die. The lubricant is chosen from such slippery solids as talc, magnesium 
and calcium stearate, stearic acid and hydrogenated vegetable oils. 

Tablet disintegrators are substances that svi^ell when wetted to break up the tablet and 
release the compound or compounds. They include starches, clays, celluloses, algms and gums, 
more particularly, com and potato starches, methylcellulose, agar, bentonite, wood cellulose, 
powdered natural sponge, cation-exchange resins, alginic acid, guar gum, citrus pulp and 
carboxymethylcellulose, for example, may be used as well as sodium lauryl sulfate. 

Tablets are often coated with sugar as a flavor and sealant, or with fihn-forming protecting 
agents to modify the dissolution properties of the tablet. The compounds may also be formulated 
as chewable tablets, by using relatively large amounts of pleasant-tasting substances such as 
mannitol in the formulation, as is now well-established in the art. 

When it is desired to administer a compound as a suppository, the typical bases may be 
used. Cocoa butter is a traditional suppository base, that may be modified by addition of waxes 
to raise its melting point slightly. Water-miscible suppository bases comprising, particularly, 
polyethylene glycols of various molecular weights are in wide Use. 

The effect of the compounds may be delayed or prolonged by proper formulation. For 
example, a slowly soluble pellet of the compound may be prepared and incorporated in a tablet 
or capsule. The technique may be improved by making pellets of several different dissolution 
rates and filUng capsules with a mixture of the pellets. Tablets or capsules may be coated with a 
film that resists dissolution for a predictable period of time. Even the parenteral preparations 
may be made long-acting by dissolving or suspending the compound or compounds in oily or 
emulsified vehicles that allow dispersion slowly in the serum. 

The combinations of this invention may be administered in a controlled release formulation 
such as a slow release or a fast release fonnulation. Such controlled release formulations of the 
combination of this invention may be prepared using methods well known to those skilled in the 
art. The method of administration will be determined by the attendant physician or other person 
skilled in the art after an evaluation of the subject's condition and requirements. 

The term "prodrug" means compounds that are transformed in vivo to yield an antiviral 
compound of the present invention. The transformation may occur by various mechanisms, such 
as through hydrolysis in blood. A good discussion of the use of prodrugs is provided by T. 
Higuchi and W. Stella, "Pro-drugs as Novel Delivery Systems," Vol. 14 of the A.C.S. 
Symposium Series, and in Bioreversible Carriers in Drug Design, ed. Edward B. Roche, 
American Pharmaceutical Association and Pergamon Press, 1987. The term, "prodrug" also 
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encompasses mutual prodrags in which one or more antiviral compounds are combined in a 
single molecule that may then undergo transformation to yield the individual antiviral 
compounds of the present invention. 

For example, if an antiviral compound of the present invention contains a carboxylic acid 
functional group, a prodrug can comprise an ester formed by the replacement of the hydrogen 
atom of the acid group with a group such as (Ci-C8)alkyl, (C2-Ci2)alkanoyloxymethyl, 1- 
(alkanoyloxy)ethyl having from 4 to 9 carbon atoms, 1 -methyl- l-(alkanoyloxy)-6thyl having 
from 5 to 10 carbon atoms, alkoxycarbonyloxymethyl having from 3 to 6 carbon atoms, 1- 
(alkoxycarbonyloxy)ethyl having from 4 to 7 carbon atoms, 1-methyl-l- 
(alkoxycarbonyloxy)ethyl having from 5 to 8 carbon atomis, N-(alkoxycarbonyl)aminomethyl 
having from 3 to 9 carbon atoms, 1^0SI-(alkoxycarbonyl)amino)ethyl having from 4 to 10 carbon 
atoms, 3-phthalidyl, 4-crotonolactonyl, gamma-butyrolacton-4-yl, di-N,N-(Ci-C2)alkylamino(C2- 
C3)alkyl (such as P-dimethylaminoethyl), carbamoyl-(Ci-C2)alkyl, N,N-di(Ci- 
C2)alkylcarbamoyl-(Ci-C2)alkyl and piperidino-, pyrrohdino- or morpholino(C2-C3)alkyl. 

Similarly, if an antiviral compound of the present invention comprises an alcohol 
functional group, a prodrug can be formed by the replacement of the hydrogen atom of the 
alcohol group with a group such as (Ci-C6)alkanoyloxymethyl, l-((Ci-C6)alkanoyloxy)ethyl, 1- 
methyl-l-((Ci-C6)alkanoyloxy)ethyI,(Ci-C6)alkoxycarbonyloxymethyl;N-(Ci- 
C6)alkoxycarbonylaminomethyl, succinoyl, (Ci-C6)alkanoyl, a-amino(Gi-C4)alkanoyl, arylacyl 
and a -aminoacyl, or a-aminoacyl-a-aminoacyl, where each a-aminoacyl group is independently 
selected from the naturally occurring L-amino acids, P(0)(0H)2, -P(0)(0(Ci-C6)alkyl)2 or 
glycosyl (the radical resulting from the removal of a hydroxyl group of the hemiacetal form of a 
carbohydrate). 

If an antiviral compound of the present invention comprises an amine functional group, a 
prodrug can be formed by the replacement of a hydrogen atom in the amine group with a group 
such as R^-carbonyl, R^O-carbonyl, NR^R'^'-carbonyl where and R^' are each independently 
((Ci-Cio)alkyl, (C3-C7)cycloalkyl, benzyl, or R^'-carbonyl is a natural a -aminoacyl or natural a- 
aminoacyl-natural a-aminoacyl, -C(0H)C(0)0Y'' wherein Ot^ is H, (Ci-C6)alkyl or benzyl), - 
C(OY^ Y^^ wherein is (C1-C4) alkyl and Y^^ is ((C,-C6)alkyl, carboxy(C,-C6)alkyl, 
amino(C,-C4)alkyl or mono-N- or di-N,N-(Gi-C6)alkylaminoalkyl, -C(Y^) Y'^ wherein Y'" is 
H or methyl and Y^ is mono-N- or di-N,N-(Ci-C6)alkylamino, morpholino, piperidin-l-yl or 
pyrrolidin-l-yl. 

The compositions for use in accordance with the present invention may be formulated in a 
conventional manner using one or more physiologically acceptable carders or excipients. 
Antiviral, SMIP, SMIS, or other immunomodulating compounds are prepared or obtained as 
described herein and in the US Patents and published intemational patent applications listed in 
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Table 1, Table 2, Table 34 and Table 35. The antiviral compounds can be formulated in 
pharmaceutically acceptable compoisitiohs suitable for delivery to the lungs. Particular 
formulations include dry powders, liquid solutions or suspensions suitable for nebulization and 
propellant formulations suitable for use in metered dose inhalers. The preparation of such 
formulations is well know to those skilled in the art, and is described in US Patent Nos. 
5,814,607 and 5,654,007 and in the US Patents and published international patent applications 
listed in Table 3 the disclosures of which are incorporated herein by reference. 

Dry powder formulations will comprise an antiviral compound in a dry, optionally 
lyophilized form with a particle size within a preferred range for deposition within the lung. 
Typically the particle size for deposition in the lung will range between 1 and 5 /im. When 
systemic delivery of the antiviral compound via absorption from the lung into the bloodstream is 
desired the antiviral compound formulation particle size is generally between 0.1 and 2 /xm in 
size; The preferred size range of particles can be produced using methods such as jet-milling, 
spray drying and solvent precipitation, for example. Dry powder devices typically require a 
powder mass in the range from about 1 mg to 100 mg to produce an aerosolized dose. Thus, the 
antiviral compound will typically be combined with a pharmaceutically acceptable dry bulking 
powder. Preferred dry bulking powders include sucrose, lactose, tirehalose, human serum 
albumin (HSA), phospholipids and glycine as well as those disclosed in the documents listed in 
Table 3. Dry powders can be administered to the subject in conventional dry powder inhalers. 
For liquid formulations the antiviral compound can be dissolved in any recognized 
physiologically acceptable carrier for use in delivery of aerosolized formulations. Such carriers 
include buffered and unbuffered aqueous solutions for water soluble compounds, and 
physiological solutions including saline solution (preferably between 0.2 and 2 N NaCl). For 
antiviral compounds with limited solubility, other liquid vehicles such as ethanol, propylene 
glycol and ethanol-propylene combinations may be used. The antiviral compounds may also be 
administered as solids in suspension. t 

For administration by inhalation, the compositions for use according to the present 
invention are conveniently delivered in the form of an aerosol spray administered via pressurized 
packs or a nebulizer, with the use of a propellant, e.g., air, dichlorordifluoromethane, 
dichloroterafluoroethane or other suitable gas. Preferably, for incorporation into the aerosol 
propellant, the antiviral compound formulations of the present invention will be processed into 
respirable particles as described above for the dry powder formulations. The particles are then 
suspended in the propellant, optionally being coated with a surfactant to enhance their 
disbursement. In the use of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. 
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Commercially available jet nebulizers are available and may be used to deliver aerosolized 
antiviral compound to a subject. Such jet nebulizers include, but are not limited to, those 
suppUed by AeroTech 11 (CIS-US, Bedford, Mass.). In addition, for delivery of aerosolized 
antiviral compound to the lungs of a subject an oxygen source can be attached to the nebulizer 
providing a flow rate of, for example, 10 IVmin. In general, inhalation is performed over a 5-40 
minute time interval through a mouthpiece during spontaneous respiration. The present 
invention provides for novel compositions comprising a suitable carrier and aerosolized antiviral 
compound in doses sufficient to reduce or ameliorate viral load and SARS symptoms in subjects 
having SARS. Such doses can be lower than corresponding systemic doses that may be used to 
those generally used to reduce or ameliorate viral load and SARS symptoms in subjects having 
SARS. 

The antiviral, SMTP, SMIS, and immunoniodulating compositions of the present invention 
may be adniinistered with a steroidal anti-inflammatory drug for the treatment of SARS and 
SARS symptoms. Examples of steroidal anti-inflammatory drugs of the invention include 
hydrocortisone, prednisolone, dexamethasone, triamcinolone acetonide, fluocinolone acetonide, 
fludrocortisone acetate, betamethasone, etc. 

The antiviral compound composition of the invention is nebulized predominantly into 
particle si^ses allowing a delivery of the drug into the terminal and respiratory bronchioles. For 
efficacious delivery of antiviral compound to the lung endobronchial space of airways in- an 
aerosol, the formation of aerosol particles having mass medium average diameter predominantly 
between 1 to.5 ^tm is necessary. The formulation must additionally provide conditions that 
would not adversely affect the functionality of the airways. Consequently, the formulation must 
contain enough of the drug formulated under the conditions that allow its efficacious delivery 
while avoiding undesirable reaction. 

For liquid solutions and suspensions, the choice of the nebulizer is made from among 
commercially available nebuUzers. The jet nebulizers known as Sidestream O, obtained from 
Medicaid and Pari LCS, LC Plus, and eHow obtained from Pari Respiratory Equipment, 
Richmond, Virginia, are examples of typical nebulizers suitable for the practice of the invention 
Ultrasonic nebulizers that produce appropriate particle sizes of about 1 to 5 /Am such as 
Aerosonic by DeVilbiss and UltraAire by Omron are also suitable. 

Advantageously, the present invention also provides for a kit for use by a consmner for the 
treatment and/or prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition 
comprising a therapeutically effective amount of at least one compound from among those 
described herein or listed in Table 34 and Table 35 or described in the US Patents and pubhshed 
international patent appUcations listed in Table 1, Table 2, and Table 35 and a pharmaceutically 
acceptable carrier, vehicle or diluent; (b) a container for holding the pharmaceutical composition; 
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and, optionally, (c) instructions describing a method of using the pharmaceutical compositions 
for the treatment and or the prevention of SARS. The kit may optionally contain a plurality of 
antiviral compounds for the treatment of SARS wherein the anti viral compounds are selected 
from 3G-like protease inhibitors and papain-liJce protease inhibitors. In a further embodiment, 
the kit contains an antiviral compound which is an RNA-dependent RNA polymerase inhibitor. 
When the kit comprises more than one antiviral compound, the antiviral compounds contained in 
the kit may be optionally combined in the same pharmaceutical composition. 

A "kit" as used in the instant application includes a container for containing the separate 
compositions such as a divided bottle or a divided foil packet. The container can be in any 
conventional shape or form as known in the art that is made of a pharmaceutically acceptable 
material, for example a paper or cardboard box, a glass or plastic bottle or jar, a re-sealable bag 
(for example, to.hold a "refill" of tablets for placement into a different container), or a blister 
pack with individual doses for pressing out of the pack according to a therapeutic schedule. The 
container employed can depend on the exact dosage form involved, for example a conventional 
cardboard box would not generally be used to hold a liquid suspension. It is feasible that more 
than one container can be used together in a single package to market a single dosage form. For 
example, tablets.may be contained in a bottle that is in turn contained within a box. 

An example of such a kit is a so-called blister pack. Blister packs are well known in the 
packaging industry and are being widely used for the packaging of pharmaceutical unit dosage 
forms (tablets, capsules, and the like). Blister packs generally consist of a sheet of relatively stiff 
material covered with a foil of a preferably transparent plastic material. During the packaging 
process, recesses are formed in the plastic foil. The recesses have the size and shape of 
individual tablets or capsules to be packed or may have the size and shape to accommodate 
multiple tablets and/or capsules to be packed. Next, the tablets or capsules are placed in the 
recesses accordingly and the sheet of relatively stiff material is sealed against the plastic foil at 
the face of the foil that is opposite from the direction in which the recesses were formed. As a 
result, the tablets or capsules are individually sealed or collectively sealed, as desired, in the 
recesses between the plastic foil and the sheet. Preferably the strength of the sheet is such that 
the tablets or capsules can be removed from the blister pack by manually applying pressure on 
the recesses whereby an opening is formed in the sheet at the place of the rjecess. The tablet or 
capsule can then be removed via said opening. 

It maybe desirable to provide a written memory aid, where the written memory aid is of the 
type containing information and/or instructions for the physician, pharmacist or subject, e.g., in 
the form of numbers next to the tablets or capsules whereby the numbers correspond with the 
days of the regimen which the tablets or capsules so specified should be ingested or a card that 
contains the same type of information. Another example of such a memory aid is a calendar 
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printed on the card e.g., as follows 'Tirst Week, Monday, Tuesday," . . .etc . ... "Second Week, 
Monday, Tuesday, . . ." etc. Other variations of memory aids will be readily apparent. A "daily 
dose" can be a single tablet or capsule or several tablets or capsules to be taken on a given day. 
Also a daily dose of one or more component(s) of the kit can consist of one tablet or capsule 
5 while a daily dose of another one or more component(s) of the kit can consist of several tablets 
or capsules. 

Another specific embodiment of a kit is a dispenser designed to dispense the daily doses 
one at a time in the order of their intended use. Preferably, the dispenser is equipped with a 
memory-aid, so as to further facilitate compliance with the regimen. An example of such a 
10 memory-aid is a mechanical counter that indicates the number of daily doses that has been 
dispensed. Another example of such a memory-aid is a battery-powered micro-chip memory 
coupled with a liquid crystal readout, or audible reminder signal which, for example, reads out 
the date that the last daily dose has been taken and/or reminds one when the next dose is to be 
taken. 

15 

EXAMPLES 

Example 1-EXAMPLE of a SARS VIRUS ISOLATE 

A SARS virus was isolated from clinical specimens of a patient in Frankfurt, Germany 
(FRA). The isolate was grown in Vero cells. RNA of the SARS virus was extracted and 
20 simplified by RT-PCR. Nucleotide sequence of the viral genome was determined by direct 
sequencing of the PGR product. Computer analysis was used to predict the features of the 
genome, to compare it to previously known coronaviruses and to the sequence of different SARS 
virus isolates. 

More specifically, isolation and sequence was performed as follows. After the third 
25 passage of the SARS virus in Vero cells, viral particles were purified by ultra centrifugation 

from 3x10^ cells supematant. Viral RNA was extracted by Triazol method (Gibco-BRL). Viral 
RNA (200 ng) was transcribed into cDNA with avian RNaseH- thermostable reverse 
transcriptase following the instractions of the manufacturer (ThermoScript RT System, 
Invitrogen). Briefly, either 50 pmoles of oligo (dT)2o (SEQ ID NO: 7389) or 25 ng of random 
30 hexamers were used to prime the RT reaction in a 20 yii final volume. Amplification and 

sequencing of the SARS genome were accomplished by direct sequencing of PCR products 
obtained with: i) specific primers from conserved regions of homology found through multiple 
alignment among known coronaviruses; ii) oligonucleotides designed around short sequences of 
SARS isolates available on the Web through WHO network laboratories; iii) degenerate primers 
35 to amplify the cDNA mixture with multiple overlapping fragments as end products. Gap closure 
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was realized by long distance PGR with high jBdelity Taq (Expand ffigh Fidelity system, Roche) 
using primers designed on selected fragments. Sequence was collected by primer walking using 
a BigDye terminator chemistry (Applied Biosystems) and an automated DNA sequencer (3700 
capillary model. Applied Biosystems). After obtaining a first pass of the entire genome, a set of 
5 both forward and reverse primers were used to amplify and sequence de novo the genome using 
as a template DNA segments of 2 kb on average. Readings from overlapping fragments were 
automatically assembled by AutoAssembler (Applied Biosystems) and the 29,740 bp contiguous 
edited manually. 

Computer analysis of the sequence was performed as follows. The GCG Wisconsin 
10 Package suite (version 10.0) was used for computer analysis of gene and protein sequences. The 
PSORT program ihttp://psort.nibb.ac.jp/) was used for localization predictions. For secondary 
structure analysis, the PHD software available on the Web at 

http://cuhic.hioc.columbia.edu/predictproteiri/ v/2iS applied. The PSI-BLAST algorithm was used for 
homology searches {http://www.ncbi.nlm.nih.gov/blast) using the non-redundant protein database. 

15 ClustalW was appHed to obtain multiple sequence alignments of gene and protein sequences. 
The LeamCoil-VMF program was used to predict coiled-coil regions in the spike proteins 
(http://leamcoil-vinf.lc5.mitedu/cgi-bin/vmf). Leucine zippers were predicted with the program 2ZIP, 
available at http://2Zyj.molgen.nipg.de. 

Phylogenetic analysis was performed using the neighbor-joining algorithm as implemented 

20 in the program NEIGHBOR within the Phylogeny Inference Package (Phylip) (Felsenstein J 
1993, program distributed by the author). Biootstrap analysis was always performed with 100 
replicates using the program Seqboot. Trees were handled and displayed using Tree View. The 
program HMMER was used to generate sequence profiles from multiple sequence alignments of 
the SI domains of spike proteins. Subsequently, the HMMPFAM program was used to compare 

25 the SI domain of SARS spike to the profiles. 

The genome of this SARS virus isolate is 29,740 bases long and the overall structure of the 
genome is similar to that of the three known groups of coronaviruses. Starting from the 5' end a 
leader sequence, an untranslated region (UTR) and two overlapping open reading frames coding 
for one polyprotein containing the enzymes necessary for replication can be identified. They are 

30 foUowed by a region coding for the spike (S), envelope (E), matrix (M), nucleocapsid (N) 

structural proteins and eight additional pRFs specific for the SARS virus. At the 3*-end of the 
genome a UTR with a poly(A) is located. The overall homology to coronaviruses groups 1, 2 
and 3 is low and therefore the SARS virus belongs to a new group (group 4) of coronavirus. 
More detailed analysis of the spike protein amino acid sequence shows that the SARS virus 

35 isolate is more closely related to coronavirus group 2. 
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The complete genome sequence of the SARS viras isolate is 29,740 bp in length. The 
sequence is available on Genbank and has a GC content of 40.8%, comparable with that of 
known viruses of the same family. Genome structure is similar to that of other coronaviruses. 14 
open reading freunes have been predicted. The principal features of the genome and gene 
5 products are illustrated reported in Figure 17 and Table 10. The comparison between the SARS 
genome and those of group 1 , 2 and 3 coronaviruses is reported in Figure 18. 

Nucleotides 1-73 contain a predicted RNA leader sequence followed by an untranslated 
region (UTR) of 197 nucleotides. The UTR is followed by two overlapping open reading frames 
(ORFla, ORFlb), which encompass two-thirds of the genome (nucleotides 265-21485). They 

10 encode for a large polyprotein, which is predicted to be processed by viral proteases to generate 
the replicase complex. The 3' part of the genome contains the genes coding for the four 
structural proteins (S, spike protein, E, envelope protein, M, matrix glycoprotein, and N, ■ 
nucleocapsid protein), and eight predicted ORFs of unknown function (Figure 17). Finally, at the; 
3' end of the genome, we found a second UTR of 340 bases followed by a poly(A) tract. We 

15 identified a putative intergenic (IG) sequence also referred to as transcription-associated 

sequence (TAS), which is a typical feature for coronaviruses. The IG sequence is characterized 
by 6-18 nucleotides present at the 3' end of the leader and can be found in front of each gene. 
The IG sequence plays a key role in RNA transcription and its regulation. The IG sequence of 
the SARS virus is characterized by the sequence SEQ ID NO: 7293 and is present nine times in 

20 the genome (Figure 17). The sequence of the leader and IG are peculiar for eaich coronavirus and 
represent a specific signature for the virus. 
The Replicase Resion 

The replicase gene, ORFlab (SEQ ID NO: 7232), consists of two overlapping ORFs, 
ORFla aiid ORFlb, which can be translated as a single polyprotein by £rame shift of the 

25 ribosome in position 13,393, within the polymerase encoding region. See Brierley et al, Embo J 
1987: 6(12): 3779-3785. As expected, a stem-loop sequence is present ten base pairs 
downstream of this site (SEQ ID NO: 7390; . 

S' -CGGTGTAAGT GCAGCCCG TCTTACACCG -3'). The polyprotein is cleaved co- and/or 
post-translationally into multiple proteins by its own encoded proteases. Using the cleavage 

30 consensus sequence and by analogy with other coronaviruses, we have mapped the possible 
cleavage sites of the polyprotein and have identified 14 products, which comprise the leader 
protein p28, the homologue of the MHV p65 protein and other twelve proteins, named from nspl 
to nspl3 (nsp, non structural protein) (Figure 17 and Table 10). The amino acid sequence 
analysis suggests the presence of several functional motifs within the putative ORFlab proteins. 

35 In particular, we have mapped two potential proteases (nspl and nsp2), one growth factor-like 
motif (nsp7) within ORFla, whereas in ORFlb we identified the RNA polymerase (nsp9), and a 
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predicted helicase (nsplO). The other predicted cleavage products (nsp3, nsp4, nsp5, iisp6, 
nspll, nspl2 and nspl3) are proteins of unknown function. Many of these proteins are 
presumably present in the RNA replication complex, which is associated with the membranous 
structures in the infected cells. In particular, nsp3 and nsp4 contain hydrophobic domains. As 
shown in Figure 18, the replicase region of SARS has a similar organization to group 1, 2 and 3 
coronaviruses; however, the overall aminoacid conservation is low (Table 11). The most 
conserved proteins are the polymerase and the helices. 

Nspl is the papain-like cysteine protease (PUP), which cleaves the first two protein 
products Oeader protein p28 and p65 homologue). Within the nspl of MHV, two domains with 
papam-like protease activity (PLPl and PLP2) have been mapped, (Kanjanahaluethai et al 
(2000) J. Virol 74(17):79 11-21) which are also conserved with Bovine, transmittable 
gastroenteritis virus (TGV) and Human 229E coronaviruses. However, by sequence alignment 
with the SARS nspl, we identified only one PLP domain containing the catalytic residues 
Cys833andHis994. 

Nsp2 is the chymotrypsin-picomavirus 3C-like protease (3CLp), which is responsible for 
the post-translational processing of the other 12 proteins, most of them cleaved at Q/A or Q/S 
sites. (Ziebuhr et cd (1999) /. Virol 73(1): 177-85). It also performs autoproteolytic activity. The 
principal catalytic residues are well conserved with other coronaviruses and are located at 
position His41 and Cysl45. Furthermore, even the conserved aminoacids Tyrl61 and Hisl63, 
which are believed to be involved in substrate recognition and to be indispensable for proteolytic 
activity, (Hegyi et al (2002) 7. Gen Virol 83(Pt3):581-593) were found in the sequence of.the 
SARS 3CLp. 

The invention includes the orf lab sequence of SEQ ID NO: 9960 and the orfla sequence 
of SEQ ID NO: 9961, including fragments, variants, homologs, etc. thereof. 
The Structural Region 

Analysis of the nucleotide sequence at the 3' part of the SARS genome identified 12 
predicted open reading frames. They are coded within 8.2 kb and comprise the four structural 
proteins S, E, M and N, common for all coronaviruses and eight predicted ORFs, which are 
specific for this virus (Figure 18). SARS-specific IG sequences upstream of most ORFs (Figures 
17 & 18) suggest that most genes are likely to be transcribed independently. Interestingly, 
sequences identical to the group 2 IG are also present at the end of the RNA leader and in front 
of the Matrix encoding gene and of ORF 10. 

The spike is a type I glycoprotein, which forms the large spikes on the surface of the virion 
and is responsible for receptor-binding and membrane fusion. (Gallagher (2001) Adv Exp Med 
Biol 494: 183-92). The protein is 1255 residues long with 17 predicted N-glycosylation sites. It 
has a 13aa leader peptide and a 17 aa C-terminal membrane anchoring sequence (1202-1218). 
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Some (MHV, HCoV-OC43, AIBV and BCoV), but not all (TGV, HPV. HCoV-229E) 
coronavirus spike proteins are proteolytically cleaved in two subunits, SI and S2. SI is 
supposed to form the bulbous head, which stays non-covalently linked to the C-terminal 
membrane anchor. Cleavage is mediated by a basic aminoacid sequence, which resembles the 
5 consensus sequence for a furin cleavage site. (Garten et.at, Biochimie 1994; 76(3-4): 217-225). 
However, in case of this SARS virus isolate, we were not able to identify such a sequence, 
implicating that the S protein of this SARS virus isolate is unlikely to be cleaved during 
maturation. Secondary structure predictions indicated that the global architecture of the spike 
protein is conserved within all known coronaviruses. The SI domain is mainly formed by beta 

10 sheets and likely adopts a globular fold, while in the S2 domain extensive alpha helical regions 
are predicted. In addition, the LeamCoil-VMF program, specifically designed to identify coiled- 
coil-like regions in viral membrane-fusion proteins, predicts two coiled-coils within S2, spanning 
aminoacids 900-1005 and 1151-1185, respectively (Figure 19). Both cpiled-coil regions contain 
a leucine-zipper motif, which is also present in the spikes of all coronaviruses. Leucine zippers 

15 are known to promote protein oligomerization; since the spike proteins of TGV and MHV form 
hetero-trimers, (Dehnas et al, J Virol 1990; 64(ll):5367-5375) (Godeke, et al., J Virology 2000; 
74(3): 1566-1571) it is conceivable that in SARS leucine zippers play a role in promoting and/or 
stabilizing a similar quatemary structure. The spike protein plays a major role in the biology of 
coronaviruses because the SI domain contains the receptor-binding doinain and the virus 

20 neutralizing epitopes, while the S2 domain is involved in the process of membrane fusion, which 
is essential for virus infectivity. As expected, multiple sequence alignment of different spike 
proteins showed a major degree of variability within the SI domain, whereas S2 is more 
conserved. 

The envelope protein E is a very short polypeptide of 76 aa, involved in the morphogenesis 
25 of the virion envelope. (Godet et al.. Virology 1992; 188(2):666-675). Computer analysis 

predicts a long transmembrane domain close to the N-terminus and two N-glycosylation sites. 
The level of aminoacid similarity with other coronaviruses is very low; and the best homology is 
with the small envelope protein of the transmissible gastroenteritis virus (TGV). 

The matrix glycoprotein (M) is a 221-residue polypeptide with a predicted molecular 
30 weight of 25 kDa. Computer analysis predicts a topology consisting of a short aminoterminal 
ectodomain, three transmembrane segments and a carboxyl terminus located at the interior side 
of the viral envelope. In analogy with the matrix glycoprotein of TGV, that of the avian infective 
bronchitis virus (AIBV) and that of the hypervirulent MHV-2 strain the SARS M glycoprotein is 
N-glycosylated at the N-terminus. SARS M protein shows highest similarity to group 2 viruses 
35 (Table 11). 
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Finally, the nucleocapsid protein N is a 397-residue-long phosphoprotein that interacts 
witii viral genomic RNA to form the nucleocapsid. The level of conservation with other 
coronaviruses is low, ranging from 26,9% identity with the HCoV-229E to 37,4% identity to the 
Bovine coronavirus (BcoV) (Table 11). Epitope analysis of the nucleocapsid protein has been 
carried out (U et al. (2003) Geno Prot & Bioinfo 1:198-206) in which the epitope site at the C 
terminus of the protein was located as SEQ ID NO: 7394 (amino acids 371-407 of SEQ ID NO: 
6052). 

In addition to the above fundamental proteins, many viruses express a set of other peptides, 
which are generally dispensable for viability, but can influence the infectivity potential of the 
virus, (de Haan et al. Virology 2002; 296(1): 177- 189). These proteins are generally conserved 
within members of the same serogroup, but differ profoundly among the groups. For this reason, 
they are generally referred to as group-specific proteins (Figure 11). Members of the group 1, 
represented here by HcoV-229E, have two group-specific genes located between the S and E 
genes and sometimes one or two ORFs downstream of the N gene, preceding the 3' UTR region 
of the genome. Viruses of the group 2, with MHV as prototype, have two group-specific genes 
(2a and HE) between ORFlb and S, as well as other two between S and E genes. Finally, the 
group 3 viruses, represented by the prototype AIBV, have two group-specific genes between S 
and E and other two between the M and N genes. 

With the exception of the henaagglutinin esterase HE, for which hemagglutinating and 
acetyl-esterase enzymatic activities have been demonstrated, all the other group-specific ORFs 
encode proteins whose role has not yet been established. 

Interestingly, the arrangement of specific genes in the S ARS genome is peculiar and the 
predicted ORFs do not display any significant homology with ORFs present in the other 
coronaviruses, nor with any other known protein from different organisms. Like viruses of the 
group 1 and 3, SARS lacks the HE hemagglutinin and does not contain ORFs between the 
ORFlb and the S gene. Furthermore, two pnrcdicted ORFs (ORF3 and ORF4) are encoded in the 
region between S and E, and superimpose for most of their length. ORF3 has an IG sequence 2 
bp upstream of the ATG start codon. In contrast to the other groups, SARS contains five 
predicted ORFs in the region between M and N genes. ORF7 is located 10 bases downstream of 
the stop codon of M gene, and has an IG sequence 155 nucleotides upstream from the ATG start 
codon. Similarly, ORF8 and ORFIO present an IG right upstream of their ATG start codons. On 
the other hand, the 5' ends of ORF9 and ORFll shortly superimpose with the flanking genes, 
and for this reason they do not need an IG to activate transcription. ORF12 totally superimposes 
with the N gene and shares very low homology with a 22kDa protein of the MHV virus, coded in 
the corresponding region. 
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Despite the absence of indications of possible localization and function deriving from 
sequence similarity, ORF3, ORF7 and ORF8 contain hydrophobic segments, suggesting 
association with membrane structures. In addition, ORF3, the longest among the SARS speciJSc 
gene, is the only one that encodes for a peptide containing a high number of predicted O- 
5 glycosylation sites (Table 1 1). Predicted N-glycosylation sites have been identified in ORF3, 
ORFllandORF12. 

Two shorter OKPs in the non-structural regions are SEQ ID NOS: 9965 and 9966. The 
invention includes polypeptides with these sequences, and also fragments, variants, etc. 
Phylogenetic analysis 

10 The substitution frequency within 922 conserved bases from the pol gene of eleven 

coronaviruses from the three different serogroups has been used in the past to show that the 
variability within members of each serogroup is much smaller than between members of 
different serogroups, confirming the previously described serological groupings. (Stephensen et 
a/., Virus Res 1999; 60(2): 181-9). We used tiie 922 bp region of the pol gene of SARS and 

15 aligned it with the same fragment from other 12 coronaviruses. The tree obtained showed that 
the SARS virus is distinct &om the other three groups of coronaviruses (Figure 20). Similar 
results were obtained using the full-length aminoacid sequences of pol, 3CL-protease and 
helicase from the replicase region and those of the spike and the matrix glycoproteins from the 
structural region (data not shown). These data confirmed that the entire genome of the SARS 

20 virus clusters in a new group (group 4) of coronavirus. 

To gain more resolution for possible evolutionary relationships we performed the analysis 
using consensus sequences of predicted domains of the proteins. In particular, we generated 
consensus sequences of the SI domain of the spike protein from the group 1 and group 2 and 
then we compared them to the SI domain of the SARS spike. No consensus could be generated 

25 from the group 3 since only the spike protein of AIBV is known. Interestingly, the tree 

constructed from the alignment of SARS SI with the consensus generated from the two groups 
of spike proteins was different from that in Figure 20, and showed a much closer relationship 
between SARS and group 2 coronaviruses (Figure 21A). Further analysis showed that 19 out of 
the 20 cysteines present in the SARS SI domain are spatially conserved with the group 2 

30 consensus sequence, while only five are maintained either within the group 1 and group 3 

sequences (Figure 21B). Given the fundamental role played by cysteines in protein folding, it is 
likely that the SI domain of SARS and group 2 coronaviruses share a similar spatial 
organization. 
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Sequence variability between SARS coronavirwip.s 

We compared the FRA sequence to the four complete SARS genomes available on the 
Web. A total of 30 mutations were detected. Nine of these mutations were silent while 21 
resulted in aminoacid substitutions (Table 12). Within ORFla, three silent and seven productive 
mutations were detected. In ORFlb, there were five silent and three productive mutations. One 
of the productive mutations was caused by two nucleotide substitutions resulting in a single 
aminoacid change. Five changes were located in the spike protein, four of these were productive 
and one silent. Two productive mutations were in ORF3 and in the matrix glycoprotein M. One 
productive mutation each was in ORFIO and in the nucleocapsid protein N. 

The overall difference between FRA and TOR2 was of nine nucleotides resulting in two 
silent mutations and seven aminoacid changes. The difference between FHA and Urbani is 12 
nucleotides, which result in five silent mutations and seven aminoacid changes. For CUHK 16 
nucleotides were different, five of which were silent mutations. For FRA and HKU 14 nucleotide 
changes resulted in four silent and nine productive mutations. 

EXAMPLE 2 -Production, Inactivdtion and Purification of Whole SARS Virus Using MCS 
Chromatography Resin Purification FoUowed by Density Gradient Ultracentrifiigatidn 

A SARS isolate FRAl (EMBL: AYS 10120) was passaged on VERO cells that were 
cultivated in DMEM (Gibco: Cat No. 21969-035, Lot No. 3078864), Penicillin/Strep (Gibco: Cat 
No. 15070-063, Lot No. 1 120042), and 3% FCS (Gibco: Cat No. 10270-106, Lot No. 
40F6130K) at 37*>C, 5% CO2. Trypsin (Gibco: Cat No. 25300-054, Lot No. 3078729) was used 
for detaching the ceUs. • 

For virus production the third passage was used for inoculation of VERO cells at a moi of 
-0.1. Cells were incubated with the virus for 1 h at 37°C in infection medium (DMEM without 
PS, FCS); after Ih cells were washed twice and further incubated at 37*'C for 48 h in the presents . 
of 3% FCS and antibiotics. The supernatant was harvested 48 hours post infection (p.i.) and 
precleared by centrifugation at 3000 rpm at 4*C for 10 min. 

The SARS virus was inactivated by p-propiolactone (BPL) treatment (1:2000) for 18 h at 
4'*C, followed by 3 h at 37°C. Testing the virus on successful inactivation, VERO cells were 
incubated with 10 ml BPL treated supematant for 4 days at 37°C; subsequentially, the 
supernatant was to-ansferred to a fresh VERO cell culture and further incubated for another 4 
days. Cells were checked for cytopathic effect (CPE). 

200 ml of the BPL-inactivated SARS virus harvest was then clarified using a 0 65 Atm- 
pore-size filter (47 nmi diameter) to pass virus particles and retain cell debris. The filter unit was 
connected to a Masterflex pump, which accomplished a consistent flow rate of 40 ml/min. 
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A MCS Chromatoeraohv Purification Step 

The filtered virus suspension was then subjected to MCS chromatography. The MCS 
column was prepared as follows. 27 ml slurry led to 14 ml sedimentated resin which was packed 
using a Gotec Superformance Column (diameter 1.0 cm, height 15.7 cm, volume 12.33 ml). 1% 
5 of the column volume of a 1% acetone solution was injected to the colmnn and the colmnn was 
run with a flow of 100 cm/h. The HETP, N and As values were then calculated as HETP: 0,056 
cm, N / m: 1790 and As = 1.20. 

The amount of proteins in the purified solution after the MCS chromatography step were 
assessed with a bicinchoninic acid (BCA) method (Thterchim) {see, e.g., 
10 http:/Avww.piercenet.com/files/bca.pdf) and electrophoresis. 

SDS-PAGE was done in accordance to Laenmili, Nature (1970) 227:680-685. Samples for 
SDS-PAGE were diluted to a protein concentration of 77 /ig/ml. Different protein 
concentrations, were loaded depending on the gel types used (10/12/15 Wells, Novex/Invitrogen): 



Number of Wells 


Protein Concentration in the Dilution 


, Load 


Protein/WeU 


10 WeUs 


77 Mg/nd 


20 Ml 




12WeUs 


77 Mg/ml 


15-20 Ml 


0.75 - 1 Mg 


15 Wells 


77Mg/inl 


lOjul 


0.5 Mg 



15 Samples for use in a reducing SDS-PAGE were prepared as follows: 





26 Ml sample or diluted sample 




+ 10 Ml NuPage Sample Buffer (4x) SDS NP0003 




+ 4 /il TCEP Bondbreaker Solution 77720 
(1:2 in MilliQ water) 


Final Volume: 


40 Ml 



The samples were heated for 10 minutes at 70°C or left at room temperature for 1 hour 
(leaving the samples at room temperature prevents the M protein of Corona Virus to 

20 coagulate/forming complexes), and then centrifuged for approximately one minute at 14,000 rpm 
in a table top centrifuge. 

Markers for use on the gel were prepared as follows. Gel bands containing less than 1 jug 
of proteins were easily visualised with the silver staining procedure using the Silver Staining Kit 
Protein, Plus One Staining Protocol (Pharmacia Biotech). 

i5 Western blotting was performed as follows. A semi-dry blotting technique was used to 

transfer the proteins from the SDS gel to a nitrocellulose membrane. The transfer was performed 
with a current of 0.8 mA/cm^ for 1 hour. A rabbit polyclonal antibody against SARS virus was 
used to perform the immuno probing using the Western Breeze, Novex Chromogenic Westem 
Blot Immunodetection Kit (Novex/Invitrogen). 

50 The chromatogram of the inactivated SARS MCS capture step is depicted in FIGURE 27. 

To estimate purity, MCS chromatography fractions were analysed by silver staining on NuPage 
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10% or 4-12% Bis-Tris-Gel (Novex) under reduced conditions, heated for 10 minutes at 70°C 
(Hgure 28). The fractions were also analysed under the same conditions by western blot (Figure 
29) to estimate purity, using PAK 11/03 SARS Gov 270603 neutralizing titer 1:512 (this 



antibody was used for this and subsequent western blots). Purity estimates are as follows: 



5 



Sample 


ydume 
/ml 


[Protein] 
//t35/ml 


Total Protein 


Step Recovery 
Protein / % 


Corona Harvest 


100 


2547.6 


254.76 


100 


After Filtration = Load 


100 


2440.3 


244.03 


95.8 


How Through 


85 


2321.4 


197.32 


77.5 


Wash 


49.32 


468.5 


23.11 


9.1 


Peakl 


12.12 


252.7 


3.062 


1.2 


Total Recovery 






464.4 


86.5 



B. Density Gradient Ultracentrifueation Step 

The eluted SARS virus fraction was then subjected to density gradient ultracentrifugation 
with a swinging bucket rotor to further purify the mactivated virus! 3 ml of MCS peak fraction 
were.loaded onto a Unear gradienT(i5-60% sucrose; 17 ml 15% and 17 ml 60% sucrose in 
10 gradient mixer). The separation was performed with a Beckman SW 28 rotor at 20,000 rpm for 
2 hours. 

The content of sucrose and protein in the. linear density gradient ultracentrifugation 
fractions are depicted in the following table, the graph in figure 30 and the estimation of purity in 
figure 31: 



15 



Fraction 


Fraction Size / ml 


[Sucrose] / % 


[Protein] / hs/ve1 


1 


2 


61 


96.12 


2 


2 


59.4 


98.62 


3 


2 


57.5 


87.63 


4 


2 


54.5 


86.91 


5 


2 


50.5 


79.9 


6 


2 


47.2 


74.3 


7 


2 


43.7 


68.05 


8 


2 


40.2 


60.43 


9 


2 


37.2 


57.38 


10 


2 


34 


53.12 


11 


2 


30 


50.63 


12 


2 


25.7 


35.02 


13 


2 


22.4 


35.33 


14 


2 


19.5 


39.25 


15 


2 


15.5 


69.79 


16 


2 


8.5 


169.03 


17 


2 


8.5 


128.96 



The protein concentration of fraction 11 (Figure 31 SDS-gel) was measured again against a 
standard curve prepared in 30% sucrose and lead to a protein concentration of 3.67 Aig/ml (0.05 
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jtig on the gel). The M protein appears to be missing in this preparation possibily due to sample 
treatment procedure (heated samples). 

There may be discrepancies in the protein concentration measurements in Table 2 due to 
sucrose interference with this assay. 

5 EXAMPLE 3 -Production, Inactivation and Purification of Whole SARS Virus Using MCS 
Chromatography Resin Purification Followed by Density Gradient UUracentrifiigation 

Inactivated SARS virus was prepared as described in Example above. 
A. MCS Chromatography Purification Step 

In this example, 200 ml of inactivated SARS virus harvest were subjected to MCS 
10 chromatography. The chromatogram of the capture step of inactivated SARS virus purification 
with MCS is depicted in FIGURE 32, the protein recovery in the following table and the 



estimation of purity in FIGURE 33: 





Volume 


[Protein] 


Total Protein 


Step Recovery 


Sample 


/ml 




/me 


Protein / % 


Corona Virus Harvest 


200 


2239.2 


447.83 


100 


After Filtration = Load 


200 


2245.1 


449.02 


100.3 


Flow Through 


185 


2126.3 


393.37 


87.8 


Wash 


49.32 


450.1 


22.2 


5.0 


Peak 1 


4.43 


1245.6 


5.52 


1.2 


Total Recovery 






421.08 


93.7 



B. Density Gradient Ultracentrifugation Step 
15 3.5 ml of MCS peak fraction were then loaded onto a linear gradient (15-40% sucrose: 16 

ml 15% and 16ml 40% sucrose in gradient mixer). The separation was performed with a 
Beckman SW 28 rotor at 20,000 rpm for 2 hours. 

The content of sucrose and protein in the linear density gradient ultracentrifugation 
firactions are depicted in the following table and the graph in FIGURE 34: 



Tube 


Fraction Size / ml 


[Sucrose] / % 


[Protein] / MS/ml 


1 


2 


40 


45.86 


2 


2 


39 


45.68 


3 


2 


37.5 


44.14 


4 


2 


35.5 


37.82 


5 


2 


33.5 


34.48 


6 


2 


31.5 


31.76 


7 


2 


30.5 


29.49 


8 


2 


28 


30.87 


9 


2 


25.5 


31:7 


10 


2 


23.5 


26.74 


11 


2 


21.75 


23.58 


12 


2 


20 


35.33 


13 


2 


18 


96.38 


14 


2 


14.5 


523.79 



-294- 



wo 2004/092360 



PCT/US2004/011710 



15 


2 


8 


941.97 


16 


2 


8 


696.7 



Protein recovery is shown in the following table and the estimation of purity is shown in figure 
35. Electron Micrograph pictures of density gradient jfractions 8, 9 and 10 are shown in figure 
36: 



5 



Step 


Volume / ml 


Protein / ns/xxd 


Total Protein /ms 


Step Protein % 


Load 


3.5 ml 


1245.6 


4359.6 


100 


Bulk Protein Fractions 


3.5 ml 


720.8 


4324.9 


99.2 


Viral Peak Fraction 


8ml 


29.7 


237.6 


53 


Total Recovery 






4562.5 


104.7 



EXAMPLE 4 - Mouse Immunization with Inactivated SARS Virus 

Mice were immunized subcutaneously on days 0, 14, and 28 with 5 fig BPL-inactivated 
SARS-CoV particles (BPL-SARS-GoV), either alone or together with Alum or MF59 as 
adjuvants. Serum was collected on days 0 (pre-immunization), 13 (post 1st immunization), 28 

10 (post 2nd), and 35 (1 week post 3rd immunization). Neutralizing antibodies were assessed for 
blocking SARS-CoV infection of Vero cells z/i vitro. After 3 immunizations, neutralization titers 
were in the range 1:100-1:1000, which are levels similar to those present in the serum of SARS 
convalescent patients. As shown in the following table, the non-adjuvanted vaccine induced 
neutralizing antibody after the third inmiunization, and potency of this vaccine was enhanced 

15 significantly by including the adjuvants, with neutralizing antibody appearing after then 2nd 
immunization and overall titers increasing after then 3rd immunization: 



Immunogen 


Neutralization Titer 


pre 


post 1st 


post 2nd 


post 3rd 


BFL-SARS-CoV+MF59 (5 nz) 


< 1:20 


<1:20 


1:158 


1:630 


BPL-SARS-CoV+Alum (5 mk) 


< 1:20 


<1:20 


1:67 


1:612 


BPL-SARS-CoV (5 hr) 


<1:20 


'< 1:20 


<1:20 


1:71 


PBS 


< 1:20 


<1:20 


<1:20 


<1:20 



EXAMPLE 5 - Balb/cMouse hnmunization with Inactivated SARS Virus 

A Balb/c mouse model for SARS infection has been developed (Subbarao et al (2004), 
20 J.Virol, 78:3572-77. In this model, Balb/c mice are inoculated intranasally with 10* TCID50 of 
virus. At 48 hours post-inoculation, a 2-log increase in the TCID50 virus titer can be delected in 
the lungs of infected mice. While virus replication is readily detected, the mice do not show any 
SARS disease symptoms and spontaneously clear the virus one week after inoculation. A 
decrease in virus titer in previously-immunized animals as compared to control animals 
25 demonstrates a protective effect of the vaccine being evaluated. 

In this example, four Balb/c mice per group are immunized three times with 5 /Mg BPL 
inactivated SARS-CoV (days 0, 14, 28) either alone or in combination with MF59 and 
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challenged with 10* TCID50 of SARS-CoV on day 43. Two days following viras challenge the 
mice are euthanized and SARS-CoV is quantified from nasal turbinates (NT) and lungs and the 
mean viras titer for each mouse is measured. Control groups received PBS alone, or an influenza 
viras vaccine (FLU) with or without MF59 adjuvant. Data were as follows (see also Figure 51), 
5 where four mice were tested per group and viras titers are expressed as :logio TCID50 per gram of 
tissue: 



Immunogen 


Vims replication in lungs of 
challenged mice 


Virus replication in nasal 
turbinates of challenged mice 


# infected/ 
# tested 


Mean (± SE) 
virus titer 


# infected/ 
# tested 


Mean (± SE) 
virus titer 


PBS 


4/4 


6.3 ± 0.3 


3/4 


2.8 ±0.35 


MF-59 alone 


4/4 


6.1+0.13 


3/4 


3.0 ± 0.38 


FLU vaccine (5 ^tg) 


4/4 


6.3 ± 0.07 


3/4 


2.9 ± 0.36 


FLU vaccine (5 fig) + MF-59 


4/4 


6.0 ±0.19 


4/4 


3.0 ±0.11 


BFL-SARS-CoV (5 ftg) 


•/4 


1.6 ±0.13* 


0/4 


Not detected ** 


BPL-SARS-CoV (5 /ig) + MF-59 


0/4 


Not detected * 


0/4 


Not detected " 



Two-tailed Student's t-test, compared to PBS-inununized mice, showed: * P<0.00001 or ** Pi=0.025 



. As shown, viras could not be detected in the BPL-SARS-CoV immunized mice. The lower 

10 limit of detection of infectious viras in a 10% w/v suspension of lung homogenate was 1.5 
logioTCIDso/gni, and in a 5% w/v suspension of nasal turbinates the limit was 1.8 
logioTCID5o/gm. Viral titers in the immunized mammals were thus below these threshold values. 

Thus the inactivated SARS-CoV vaccine was very efficient at preventing viras infection, 
as only one of eight mice immunized with the vaccine, either with or without MF59 adjuvant, 

15 was infected. Similar protection was not observed in control groups of PBS diluent, MF59 
adjuvant, or influenza viras vaccine with or without adjuvant. 

Neutralization titers of sera taken from the animals in the challenge study were assessed at 
two weeks post-lst, one week post-2nd, and one week post-3rd inoumunization. Mice immunized 
with the vaccine with MF59 adjuvant had already developed a neutralization titer of 1:71 after 

iO the 2nd immunization, which increased to 1:588 after the 3rd immunization, whereas mice 
receiving the unadjuvanted vaccine did not have any neutralizing activity post-2nd and a 
neutralization titer of 1 :64 post-3rd immunization. Sera from mice in each of the control groups 
did not show any neutralization activity. These data clearly demonstrate not only the ability of 
the inactivated SARS-CoV vaccine to induce protective levels of SARS neutralizing antibodies, 

15 but also a beneficial effect of formulating the vaccine with adjuvant for elevated neutralization 
titers. 

EXAMPLE 6 - Preparation of OMV comprising SARS viral antigens 

E.coli were transfected with a plasmid of interest (encoding a SARS viral antigen). Single 
colonies harbouring the plasmid of interest were grown overnight at 37**C in 20 ml of LB/Amp 
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(100 fig/ml) liquid culture. Bacteria were diluted 1:30 in 1.0 L of fresh medium and grown at 
either SO^C or ST^C until the OD550 reached 0.6-0.8. Expression of recombinant protein was 
induced with IPTG at a final concentration of 1.0 mM. After incubation for 3 hours, bacteria 
were harvested by centrifugation at 8 000 x 5 for 15 minutes at 4°C and resuspended in 20 ml of 
20 mM Tris-HCl (pH 7.5) and complete protease inhibitors CBoehringer-Mannheim™). All 
subsequent procedures were performed at 4°C or on ice. 

Cells were disrupted by sonication using a Branson Sonifier 450 and centrifiiged at 5 000 x 
g for 20 min to sediment unbroken cells and inclusion bodies. The supernatant, containing 
membranes and cellular debris, was centrifuged at 50000g (Beckman Ti50, 29 000 rpm) for 75 
min, washed with 20 mM Bis-tris propane (pH 6.5), 1.0 M NaCl, 10% (v/v) glycerol and 
sedimented again at 50000g for 75 minutes. The pellet was resuspended in 20mM Tris-HCl (pH 
7.5), 2.0% (v/v) Sarkosyl, complete protease inhibitor (1.0 mM EDTA, final concentration) and 
incubated for 20 minutes to dissolve inner membrane. Cellular debris was pelleted by 
centrifugation at 5000g for 10 min and the supernatant centrifuged at 75000g for 75 minutes 
(Beckman Ti50, 33000 rpm). Outer membrane vesicles were washed with 20 mM Tris-HCl (pH 
7.5) and centrifuged at 75 000 x g for 75 minutes or overnight. The OMV was finaUy 
resuspended in 500 nl of 20 mM Tris-HCl (pH 7.5), 10% v/v glycerol. Protein concentration 
was estimated by standard Bradford Assay (Bio-Rad), while protein concentration of inner 
membrane fraction was determined with the DC protein assay (Bio-Rad). Various fictions from 
the isolation procedure were assayed by SDS-PAGE. 

EXAMPLE 7 - Immunogenicity, dose and route schedule for recombinant Spike protein in 
mice 

The immunogenicity, route and dosing of the recombinant spike proteins of the invention 
in mice may be assessed using the below detailed protocol. Preferably, the administered antigen 
will elicit neutralizing antibody titers at least in the range of 1/100-1/1000. Increasing doses of 
antigen can be tested in the range from 5 to 20 |ig of recombinant Spike antigen alone or mixed 
with an equal volume of MF59-citrate, administered SC or IM to anesthetized mice in 100 \x\ of 
inoculum. Groups of B ALB/c mice, 6 per treatment are primed at day 0 and boosted at day 14 
and 28. 
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— ^Ts^ — 


Xr63tiiiGnt 




dstnpliiiQ intcrvQl 


NumbGr of mice 








7 21 35 42 d 


6 p6r dos6 Ibv6I 


4^6 


R6c-Spik8 protsin 


20, 10, 5 /vg/SC 


7 


6 psr dosG lovol 


7-9 


Rec-Spik6 protsin 


20, 10, 5 jug/IM 


7, 21, 35, 42 d 


6 p6r dos6 Isvol 


10-12 


RQC-Spik6 protGin 


20, 10, 5 fJQl\lJ[ 


7 


6 per dose level 








7 21 35 42 d 


6 per dose level 


lfl8 


R©c-Spik6 " MF59 


on in K nn/Qr* 




6 per dose level 





RGC-Spike - MF59 


«dU, lU, o //g/iM 


7 01 OK AO H 


6 per dose level 





Rec-Spike - MF59 


20 10 5 //g/IM 






25 


MF59 


NA/SC 


7, 21,35, 42 d 


6 + 6 (sac d 7 and 42) 


27 


MF59 


NA/IM 


7, 21, 35, 42 d 


6 + 6 (sac d 7 and 42) 


29 


Salins 


NA/SC 


7, 21 , 35, 42 d 


6 + 6 (sac d 7 and 42) 


31 


Saline 


NA/IM 


7. 21, 35.42 d 


6 + 6 (sac d 7 and 42) 



This protocol can also be used to assess the Thl/Th2 profile of the specific immune 
response elicited by the recombinant Spike protein. Neutralizing and Spike-specific antibody 
titers will be assessed at days 7, 21, and 35; IgG2a IgGl isotype of the Spike-specific . 

5 antibodies will be determined at days 21 and 35; in vitro proliferation of lymph node arid splenic 
T cells against the recombinant Spike protein will be determined at days 7 and 42, respectively; 
IFN-Y and IL-4 production by splenic T cell against the recombinant Spike protein from SARS- 
CoV will be assessed at day 42. Peripheral blood will be collected at days 7, 21, 35; lymph 
nodes cells at day 7, and spleen cells at day 42. Neutralizing and Spike-specific antibody titers 

10 and isotypes will be determined by inhibition of SARS-CoV infection of Vero cells and by 

ELISA, respectively. Proliferation of lymph node and splenic cells will; be determined by ^[H]- 
Thymidine uptake. Frequencies of splenic IFN-7 and IL-4 producing T lymphocytes, will be 
determined by ELISPOT and FACS. 

EXAMPLE 8 -Immunogenicity, dosing and route schedule for Spike proteins in rabbits 
15 The immunogenicity, route and dosing of the recombinant spike proteins of the invention 

in rabbits may be assessed using the below detailed protocol. Increasing doses can be tested in 
the range fijom 5 to 40 fig of recombinant Spike antigen alone or mixed with an equal volume of 
MF59-citrate, administered SC or IM to anesthetized animals in 200fJLl of inoculum. Groups of 
New Zealand white female rabbits, 10 per treatment, will be immunized as shown in the table 
20 below. The animals will be primed at day 0 and boosted at days 14 and 28. Peripheral blood 
will be collected at days 7, 21, and 35. Neutralizing and Spike-specific antibody titers will be 
determined by inhibition of SARS-CoV infection of Vero cells and by ELISA, respectively. 
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Group 


Treatment 


Dose/Route 




Number of rabbits 


1-4 


Full-length Spike protein 


40, 20, 10, Sjjg/SC 


^Tsti^ard*^^' 


10 per dose level 


5-8 


Full-length Spike protein 


40, 20, 10, Sfjg/IM 


7 2^' 35d 


1 0 per dose level 


9-12 


Truncated Spike protein 


40, 20, 10, 5//g/SC 


7 21 35d 


1 0 per dose level 


13-16 


Truncated Spike protein 


40, 20, 10, 5jug/IM 


7 21 35d 


1 0 per dose level 


17-20 


Full-length Spike protein - MF59 


40, 20, 1 0, 5//g/SC 


7 21 35d 


1 0 per dose level 


21-24 


Full-length Spike protein - MF59 


40, 20, 10, Spg/IM 


7 2 'ssd — ~ 


1 0 per dose level 


25-28 


Truncated Spike protein - MF59 


40, 20, 10, 5//g/SC 


7 21 35 d 


1 0 per dose level 


29-32 


Truncated Spike protein - l\/IF59 


40, 20, 10, 5//g/lM 


7, 21, 35 d 


1 0 per dose level 


33 


MF59 


NA/SC 


7, 21,35d 


10 


34 


MF59 


NA/IM 


7, 21,35d 


10 


35 


Saline 


NA/SC 


7, 21,35d 


10 


36 


Saline 


NA/IM 


7, 21,35d 


10 



EXAMPLE 9 - Immunogenicity and dose schedule for recombinant Spike in ferrets 

The immunogenicity and dosing of the recombinant spike proteins of the invention in 
ferrets may be assessed using the below detailed protocol. Three groups of ferrets, 6 for 
5 treatment, will be immunized with recombinant SARS-CoV Spike protein from CHO cell lines, ' 
alone or mixed with an equal volume of MF59-citrate, administered SC to anesthetized animals 
in 200/il of inoculum. The recombinant Spike protein vaccine will be tested at the dose eliciting 
the highest neutralizing antibody titers in mice at day 35 after the second boost. The animals 
will be primed at day 0 and boosted at day 14 and 28. Peripheral blood will be collected at days 
10 7, 21, and 35. Neutralizing and Spike-specific antibodies titers will be determined by inhibition 
of S ARS-CoV infection of Vero cells and by ELISA, respectively. 



Groups 


Treatment 


Dose/Route 


Sampling interval 


Number of ferrets 


1 &2 


Rec-Spike protein 


YA/gor2Y//g/SC 


7, 21,35d 


6 


3&4 


Rec-Spike protein -i- MFSO 


Y//gor2Y//g/SC 


7,21, 35 d 


6- 


5 


Saline 


NA/SC 


7, 21,35d 


6 



The 3 groups of ferrets, 6 animals per group, used for the immunogenicity studies above 
can then be used to assess efficacy of the recombinant Spike protein in protecting vaccinated 

15 animals from infection and/or disease. Anestethized animals will be challenged two weks after 
the last boost intratracheally with 10^ median tissue culture infectious dose unit (TCID50) of the 
SARS-CoV Utah strain. Infection by SARS-CoV will be assessed by taking nasal, faringeal and 
rectal swabs from animals for 20 days after challenge as described (12). The presence of SARS- 
CoV in sample materials will be assessed by RT-PCR and infection assay of Vero cells. 

20 Animals will be monitored for clinical signs of SARS disease by assessing sleeping time, 
temperature, respiratory symptoms, diarrhea, body weight and survival. Protection will be 
determined by the magnitude and duration of viras shedding and by duration and severity of 
disease symptoms and percentages of surviving animals. 
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EXAMPLE 10: Expression of Spike protein for vaccination 

The SARS-CoV Spike glycoprotein was expressed in both fiill-leiigth and truncated forms, 
using the nSh and nShATC pCMVm constructs described above, both with hexahistidine tags. 
The vector constructs were evaluated for expression 48 hr after transfection into 293 cells and 
5 COS7 cells. The full-length Spike protein (nSh) was detected by western blot only in cell lysate, 
but not in culture media (Figure 52). 

The majority of SARS-CoV full-length Spike protein was expressed in 
transiently-transfected COS7 cells as a high molecular glycoprotein which ran at 540 kDa in 
non-reducing gels (Figure 53). The gp540 is heat labile as indicated by the complete dissociation 

10 into monomeric forms (gpl70 & gpl 80) by boihng, but it was resistant to DTT treatment. These 
data suggest that the recombinant Spike protein is noncovalently associated into a homotrimer 
(gp540). The presence of Spike protein in homotrimeric association also was confirmed in 
inactivated, purified SARS-CoV vkion particles. Analysis of virion proteins by western blot 
under the same condition used for the characterization of recombinant Spike protein generated 

1.5 essentially identical results (Figure 54). 

EXAMPLE 11: Spike protein processing 

In order to characterize Spike protein processing. BHK-21 cells were infected with 
alphavirus replicon particles expressing the SARS-CoV full-length Spike. At 6 hoursr post- 
infection with an MOI of 5, infected cells were labeled for 1 hr with L-pS]methionine/cysteine 

10 and chased for up to 4 hours. The [^^S]-labeled spike protein was immunoprecipitated by anti- 
SARS rabbit serum and digested with Endo-H. Both digested and undigested proteins were 
analyzed by SDS-PAGE (4% polyacrylamide). As shown in Figure 55, the full-length spike 
protein is synthesized as an Endo-H sensitive high-mannose glycoprotein (gplTO, an ER form) 
that undergoes modification to an Endo-H resistant glycoprotein with complex oligosaccharides 

:5 (gplSO, a Golgi form). The conversion of gplTO into the gpl80 form takes place within 2 hours 
(Figure 56). 

EXAMPLE 12: High-level protein expression 

To develop a system for rapid expression of protein antigens, DNA transfection of 293 
(human embryonic kidney) cells was used, tp obtain milligram quantities of recombinant 
0 antigen. The most conraion method for culturing and transfecting 293 cells is in static or 

monolayer cultures. These procedures were modified by performing large-scale transfection of 
293 cells in suspension and expanding the transfected cells in suspension culture for production 
of secreted or intracellular proteins. Several initial experiments were performed at the 100- 
milliliter scale cultures to determine optimum conditions, such as number of cells, type of 
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transfecting reagent (FuGENE 6, Lipitoid or RO-1538) and the ratio of DNA to transfection 
reagent. Based upon pilot experiments, FuGENE 6 was the best transfecting reagent. 

The kinetics of gene expression was compared to other viral envelope glycoproteins, and 
the data suggest that stable protein expression peaks around 72 to 96 hours post-transfection, 
5 depending upon the gene of interest, and then significantly decreases thereafter. Thus, using the 
optimum conditions, the transfection process was scaled from 100 ml to 4 liters. The 4 liter 
culture can be used for rapidly producing 2-10 milligrams of protein antigens. To facilitate 
antigen purification and also msaimize the yield and recovery of the purified protein, 
transfection conditions were optimized by using serum-free medium. 
10 Bulk transfection procedure has been used for the expression of truncated and full-length ' 

Spike antigens. The kinetics of expression for truncated form of the spike protein is presented in 
Figure 56A. Expression of the truncated form of Spike protein peaked around 48 hrs and was 
stable until 72 hrs, therefore the cultures were, harvested at 72 hrs post transfection. 

Collected media were concentrated 20X and used for purification of truncated Spike 
15 protein by a very simple purification strategy where the truncated form of the spike was captured = 
on GNA lectin followed by DEAE and ceramic hydroxyapatite column chromatography. The 
purified protein was analyzed on SDS-PAGE by silver stain (Figure 56B) and also by western 
blot (Figure 56C). Early efforts were able to purify the truncated form of the spike protein with 
>95% purity and approximately 50% recovery. The molecular rriass of the truncated form of the 
20 Spike protein is approximately 170-180 kDa. 

Full-length Spike protein was expressed in 293 cells using the bulk transfection strategy. 
The expression data suggest that, like the truncated form, expression peaked around 48 hrs post- 
transfection and remained stable until 72 hrs. However, contrary to the truncated form and as 
expected, full-length protein is not secreted, but rather is retained within the cells, as shown by 
25 the absence of any signal in western blots of cell culture supematants. The full-length form of tiie 
protein was purified from Triton X-100 detergent-extiracted cells. Full-length Spike protein was 
then captured on GNA lectin, followed by hydroxyapatite and SP chromatography. The 
calculated molecular mass of full-length spike protein is approximately 600 kDa, which is close 
to the theoretical mass for the trimer. 

30 EXAMPLE 13: SARS virus seed cultures 

A S ARS-CoV reference seed virus propagated only in certified Vero cells will be used for 
the generation of the Master and Working Virus Seeds under GMP. A clinical specimen from the 
respiratory tract of a patient infected by the S ARS-CoV is inoculated onto documented VERO 
cells, with certified culture media. Culture media containing the virus are harvested at 4 days 

35 post-infection and designated Passage 1 (PI). A second round of virus propagation is again 
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peifoimed in certified VERO cells with certified media, by inoculation of 1 ml per T-75 flask of 
100 times diluted PI virus. Culture supernatant was harvested at 3 days post-infection and stored 
at -SO'^C as a P2 reference stock virus, without plaque purification. 

Cell banks of Vero cells for further production of SARS-CoV are prepared from specific 

5 cell subsets that have not been used since the emergence of transmissible spongiform 

enephalopathies (e.g. since 1980). A research cell bank of these cells has been prepared using 
specified New Zealand-origin fetal bovine serum. From this research cell bank, a Master Cell 
Bank (MCB) is made under GMP conditions and using only specified and well-controlled media 
and supplements. The cell bank will is tested for absence of adventitiouis agents according to 

10 applicable US, EU, and international guidelines (see Points To Consider "Characterization of 
cell lines used to produce biologicals", FDA/CBER 7/1993; ICH Q5D Draft 6 "Cell substirates", 
Oct.23, 1996; CPMP/ICH/294/95 "Note for Guidance on Quality of Biotechnological Products: 
Derivation and Characterisation of Cell Substrates Used for Production of 
Biotechnological/Biological Products. (Step 4, 16. July 97); WHO final draft "Requirements for 

15 use of animal cells as in vitro substrates for the production of biologicals" 7.3.1997). 
Tumorigenicity and identity testing is also required for this cell bank. 

The reference virus is plaque-purified and expanded in certified Vero cells in the absence , 
of PCS in order to generate Master arid Working Seeds. Another option to help ensure piirity and 
facilitate the assessment of safety of the Master Seed is to subject the SARS-CoV to pelleting 

10 and resuspension in PBS. The virus suspension is made up to 60% (w/w) sucrose with crystalline 
sucrose, transferred to a centrifiige tube and overlayed with 50, 40, 30, and 20% (w/w) sucrose 
solutions in PBS. The gradient is centrifiiged for 72h and then ft^actionated. The virus-containing 
fraction is diluted and the virions re-pelleted by ultracentrifugation. RNA from the virus pellet is 
isolated and transfected into certified Vero cells whereby the "infectious" positive-strand RNA 

IS will lead to the production of infectious virus, which can be plaque-purified and expanded to 
generate alternative Master and Working Seeds from purified virus RNA. 

Viral seeds are tested for the absence of adventitious agents (see e.g. 21 CFR Revised as of 
April 1, 1994, § 630.35 Test for safety) and for identity, using a highly-specific neutralizing 
antiserum prepared ftom an independent source. Safety testing of viral seeds for vaccine 

}0 purposes is done routinely by service laboratories. Broad-spectrum PCR testing can be used 9S 
an addition and/or alternative for testing. 

EXAMPLE 14: Scale-up of virus production and inactivation 

A protocol for the production, inactivation, and purification of inactivated SARS-CoV with 
sufficient structural integrity to elicit protective neutralizing antibody responses in animal 
J5 models involves: Vero.cells are infected with virus at an M.O.I. of 0.01 in the absence of PCS 
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and antibiotics; culture medium is collected, cleared by centrifiigation, and inactivated with BPL, 
followed by confirmatory testing for complete inactivation; the inactivated material is filtered, 
subjected to MCS-column purification, and further purified by sucrose gradient centrifugation. 
Several modifications and improvements can be developed when adapting this basic 
5 protocol to a larger scale for commercial use. Firstly, the cell culture and infection process can be 
adapted to roller botties, as an intermediate step to allow rapid production for preliminary trials 
within existing BSL 3+ facilities. Full commercial production will typically use a fennentation 
process in a closed system, but a roller bottie system can be achieved more rapidly. The roller 
botties do offer a true suspension culture system for Vero cells, which gives various technical 

10 and safety advantages over microcarrier cultures. Suspension cultures can be grown to any 

desired fermentation scale without interfering with the closed system between cell passages, as 
no trypsinization is required. 

To scale up the infection process in roller botties to 30-50 liters per batch, the optimum 
M.O.L and harvesting periods for selected media and culture conditions should first be 

15 determined. For the larger scale, methods for harvesting and handling larger volumes of highly 
infectious material safely should be used, and so cell separation via centrifugation should be 
replaced by a method such as filtration through single-use filter cartridges. 

The MCS-chromatography and the gradient purification steps described above can readily 
be scaled to a batch volume of up to 50 liters. For larger volumes, however, and for increased 

20 purity, ultrafiltration and sterile filtration steps will be used. Nuclease treatment to remove host 
ceU DNA will also be included. 

EXAMPLE 15: Large scale analytical methods 

Analytical methods for the SARS coronavirus include virus titration methods, 
immunological and physico-chemical metiiods to quantitate and characterize the purified antigen 
25 (ELISA, PAGE, western blots using specific antisera against purified whole virus, etc.). Other 
analytical tests include: fast yield testing via asymmetric field flow separation and laser particle 
detection and counting; Western blot using specific antisera against individual viral proteins; and 
tests for residual host cell DNA. 

Residual DNA testing is generally done by hybridization e.g. using a limit test. Such 
30 testing is performed according to methods already established and validated for other cell lines. 
As an alternative, the ThresholdT^* method may be used. 

For producing specific antibodies, recombinant protein expression of all the ORFs firom the 
stiuctural and non-stiiictiiral gene regions of die SARS-CoV is used. The ORFs can be cloned 
and expressed in E.coli and, if necessary, also in eukaryotic vectors such as baculovirus. This can 
35 provide sufficient amounts of purified soluble protein to inmiunize mice and rabbits to produce 
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polyclonal and monoclonal antibodies against SARS proteins and to set up specific ELJSA 
assays. Different expression vectors can be tested to maximize the yield of recombinant protein 
in a soluble form e.g. different vectors, one containing sequences coding for six N-terminal 
histidine residues and another containing a Glutathione-S-transferase protein fused to the 

5 C-terminus of the SARS protein. The recombinant proteins can be purified by single step column 
chromatography on either Nickel chelating Sepharose or Glutathione-Sepharose 4B resin. These 
procedures are very rapid and generally produce protein of 60-90% purity, which is suitable for 
raising specific antisera (Pizza et al. (2000) Science 287:1816-20). I^ive mice and two rabbits for 
each recombinant protein can be immunized SC with 20 and 50 jttg recombinant protein, 

10 respectively, given in IFA as adjuvant, at day 0, 14 and 28. Sera are collected at day 7, 21 and 35 
to assess specific titers before euthanasia of the animals for collection of blood and removal of 
spleens. 

For the detection of impurities {e.g. Vero cell derived proteins) in the vaccine preparation, 
rabbit serum reactive against Vero-derived proteins can be used. Such antisera are obtained by 

15 immunizing rabbits with at least lOjLig of Vero cell lysate with CFA/IFA. The sera can be 

verified for reactivity against Vero-derived proteins in western blots. For more specific antisera 
against specific relevant cell-derived proteins that tend to be co-purified with the virus, mock- 
infected cell culture harvest that have undergone the purification process can be prepared and 
used for immunizing rabbits. 

20 Methods to determine neutralization titers of sera from inununized animals and humans 

can be developed, without the constraints of using infectious SARS-CoV in a BSL-3+ 
laboratory. One such strategy will be to use recombinant antigens, particularly Spike protein or 
Spike-derived epitopes, and to develop ELISA assays for measuring antibodies against the target 
protein. Suitable epitopes allow a correlation to be established between the ELISA values and 

25 virus neutralization assay values. This approach provides a faster and more efficient (higher- 
throughput) comparison of specific and protective antibody titers. This ELISA test is also the 
ideal tool to monitor specific antibodies in safety trials, where several hundred animal sera must 
be tested. 

Another strategy is to combine structural elements from both the pathogenic SARS-CoV 
30 and the non-pathogenic coronavirus mouse hepatitis virus (MHV) to construct chimeric vinis- 
like particles (VLPs) that can be labeled. The assay is based on fusion between octadecyl 
rhodamine (R18)-labeled VLPs and cells (Hoekstra et al. (1984) Biochemistry 23:5675-81). The 
method relies on the relief of fluorescence self -quenching of R18 incorporated into VLPs upon 
fusion with cellular membranes. Coronavirus VLPs have been shown to mimic native virions 
35 with respect to their appearance in the electron microscope (EM) and their biological activities. 
As they do not contain viral RNA, however, then they cannot cause a productive infection 
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(Vennema et al. (1996) EMBO J 15:2020-2028). The VLP system can be used for the mouse 
hepatitis virus (MHV) strain A59 (MHV-A59) )Godeke et at. (2000) J Virol 74:1566-15) 
containing a chimeric S protein. The protein chimera, consisting of the ectodomain of the SARS- 
CoV and the transmembrane and endodomain (64 C-terminal amino acid residues) from the 
MHV spike protein, can be co-expressed with the MHV M (membrane) and E (envelope) protein 
in OST-7 cells )Godeke et al). VLPs secreted in the supernatant are harvested, purified and 
labeled with octadecyl rhodamine (R18) (Hoekstra et at). A constant amount of WLPs is 
incubated with a serial dilution of sera at 37°C for 1 hour in a 96-well plate. Subsequently, ceUs 
expressing the receptor for the SARS-CoV, the angiotensin-converting enzyme 2 (ACE2) (Li et 
al. (2003) Nature 426:450-54) is be added and the extent of fusion can be measured with a 
fluorescence spectrophotometer. 

A final strategy to monitor the ability of sera to inhibit cell-cell fusion interactions between 
ceUs expressing the SARS-CoV S protein and a human cell hne expressing the angiotensin- 
converting enzyme 2 (ACE2), a functional receptor for SARS-CoV (Li et al). This reporter 
gene-based assay uses the fluorescent shift (green to blue) of the fluorogenic substrate CCF2/AM 
(AM=acetoxymethyl) upon cleavage by P-lactamase (Bla) as read-out for ceU-cell fusion 
(Zlokamik et al. (1998) Science 279:84-88). For this assay, a BHK-derived cell line, stably 
expressing Bla and the SARS-CoV S protein is generated. In addition, a human cell line 
expressing ACE2 on its surface is used. BHK ceDs, expressing the S protein on their surface and 
Bla in their cytosol are incubated with serial dilutions of the sera to be tested for Ih at 37'='C. The 
ceU line expressing the ACE2 is loaded with 1/iM CCF2/AM for 1 h at 22°C, washed twice with 
PBS, and co-cultivated with the BHK cells. In case of cell-ceU fusion, Bla cleaves the substrate, 
resulting in a green blue shift with excitation at 409 nm. Inhibition of fusion by sera thus 
provides a detectable change. 

EXAMPLE 16: Stabilisation of inactivated SARS-CoV 

Although the purified inactivated SARS-CoV vaccine is capable of inducing potent 
neutralizing antibody responses in animals, it is relatively instable and can benefit from 
formulation to increase stability for an acceptable period of time. Suitable formulation changes 
include the use of various buffer systems, pH ranges, stabilizing excipients (e.g. sugars and 
sugar alcohols, amino acids, etc.) etc.. Stability testing can be conducted in real-time at normal 
storage temperatures, or can be conducted in an accelerated manner by using elevated 
temperatures. Vaccine stability can thus be increased to approximately one year or longer. 
Lyophilized vaccine formulation can also be used to extend shelf-life, possibly with further 
additives for stability during lyophilisation. 
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EXAMPLE 17: Dose and schedule optunizationfor inactivated virus 

Animal models of SARS-CoV infection have been reported, including mice, ferrets and 
macaques. As mentioned in example 4 above, mice immunized with the BPL-SARS-CoV 
vaccine achieve neutralizing antibody titers in the range of 1:100 — 1:1000, similar to levels 

5 found in convalescent patients, and are 100% protected from infection with a challenge virus. 
While the mouse challenge model is limited only to infection but not disease, ferrets and 
macaques are useful models of the human SARS disease. Two to four days after inoculation with 
SARS-CoV, both ferrets and macaques have been found to shed infectious SARS-CoV particles 
from the throat, nose and pharynx, as demonstrated by RT-PCR and/or virus isolation on Vero 

10 cells. At approximately the same time, the infected animals became lethargic, show respiratory 
distress and eventually die. Histologically, SARS-CoV infection in these animals associates with 
pulmonary lesions of different severity, similar to those found in biopsied lung tissue and 
autopsy material from SARS patients. With the availability of these models, preclinical smdies 
with vaccines can be performed initially in mice for inmiunogenicity readouts, while efficacy of 

15 optiinal doses and schedules can be assessed in the ferret and macaque models. 

Initial studies in mice are used to determine the optimal dose and schedule required to 
elicit the highest levels of neutralizing antibody, with titers at least in the range of 1/100 - 
1/1000. In parallel to the assessment of neutralizing activity, other features of tiie humoratl 
immune response and cellular immune responses can be investigated. In particular sera from 

10 immunized mice can be assessed for the isotype (IgGl v*. IgG2a) of the Spike-specific antibody 
response. Also, the frequencies of splenic CD4+ T cells producing IFN-y and JL-4 in response to • 
BPL-SARS-CoV particles wiU be assessed by ELISPOT and HLISA. These experiments can 
provide insight into the quality of the T cell response helping the priming of a protective 
antibody response. 

15 Increasing vaccine doses can be tested (e.g. from 5 to 20 /xg of BPL-SARS-CoV alone or 

mixed with an equal volume of MF59-citrate), administered SC to anesthetized mice in 100/il of 
inoculum. Groups of BALB/c mice, 10 per treatment, are inmiunized, with priming at day 0 and 
boosting at days 14 and 28. Secondary endpoints compare the kinetics of neutralizing vs. Spike- 
specific antibody titers and assess the Thl/Th2 profile of the specific immune response, and so 

30 neutralizing and Spike-specific antibody titers are assessed at days 7, 21, 35, and at 2, 3, 4, and 5 
months after priming. The IgG2a and IgGl titers of Spike-specific antibodies are determined at 
days 21, 35, and at 2, 3, 4, and 5 months after priming. Proliferation and IFN-y and JL-4 
production by splenic T cells against recombinant Spike protein from SARS-CoV are assessed at 
day 42, and at the end of the 5th month. Peripheral blood is collected at days 7, 21, 35, and at 2, 

35 3, 4, and 5 months after priming. Spleen cells will be obtained at day 42 and at the end of the 5th 
month. Neutralizing and Spike-specific antibody titers and isotypes are determined by inhibition 
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of infection of Vero cells and by EUSA, respectively. Proliferation of splenic cells is determined 
by ^|TH]-thymidine uptake. Frequencies of splenic IFN-y and IL-4 producing CD4* T 
lymphocytes is determined by ELISPOT and FACS analysis. 

Based on mouse results, the BPI^SARS-CoV vaccine can be tested in ferrets for the 
5 induction of protective neutralizing antibody titers. Ferrets are immunized according to a similar 
schedule as the mice and at the dose that elicits the highest neutralizing antibody titers in mice at 
day 35 after the second boost. Three groups of ferrets, 6 per treatment, are immunized with BPL- 
SARS-CoV, alone or mixed with an equal volume of MF59-citrate, administered SC to 
anesthetized animals in 200jul of inoculum. The animals are primed at day 0 and boosted at days 

10 14 and 28. Peripheral blood is collected at days 7, 21, and 35. Neutralizing and Spike-specific 
antibodies titers aie determined by inhibition of SARS-CoV infection of Vero cells and by 
EEJSA, respectively. Each group of ferrets is used to assess efficacy of the BPL-SARS-CoV in 
protecting vaccinated animals from infection and/or disease. Anesthetized animals are 
challenged intratracheally, two weeks after the last boost, with 10^ median tissue culture 

15 infectious dose units (TCID50) of the SARS-CoV CDC strain. Infection by SARS-CoV can be 

assessed by taking nasal, pharyngeal and rectal swabs from animals for 20 days after challenge > 
(Martina et at. supra). The presence of S ARSrCoV in sample materials can be assessed by RT- . 
PGR and infection assay of Vero cells. Animals can be monitored for clinical signs of SARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 

JO and survival. Protection can be determined by the magnitude and duration of virus shedding, by 

. ; duration and severity of disease symptoms, and by percentage of surviving animals. The 

formulation eliciting the highest neutralizing antibody titers at day 35 can then be tested against a 
two-fold higher dose of BPL-SARS-CoV given in the same formulation in the same regimen. 
Additional studies can evaluate immunogenicity and efficacy of the candidate vaccine in 

15 non-human primates. Three groups of adult cynomolgus macaques, 4 per treatment, are 
immunized with BPL-SARS-CoV, alone or mixed with an equal volume of MF59-citrate, 
administered SC to anesthetized animals in 500/tl of inoculum. The BPL-SARS-CoV vaccine 
can be tested at tiie dose eliciting the highest neutiralizing antibody titers in ferrets at day 35 after 
the second boost. The animals are primed at day 0 and boosted at 3 and 6 weeks. Peripheral 

JO blood is collected at weeks 1 , 4, and 7. A secondary endpoint is to assess die Th 1/Th2 profile of 
, die specific immune response. Neutiralizing and Spike-specific antibody titers and frequencies of 
peripheral blood CD4+ T cells producing IFN-7 and IL-4 in response to die recombinant SARS- 
CoV Spike protein is thus assessed at weeks 1, 4, and 7. Neutralizing and Spike-specific 
antibody titers can be determined by inhibition of SARS-CoV infection of Vero cells and by 

55 ELISA, respectively. Inti-acellular cytokine staining and FACS analysis will be used to quantify 
IFN-Y- and IL-4-producing CD4* T cells. The macaques can also be used to assess efficacy of 
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the BPL-SARS-CoV in protecting vaccinated animals firom infection and/or disease. 
Anesthetized macaques can be challenged two weeks after the last boost with 10* median tissue 
culture infectious dose unit (TCID50) of the SARS-CoV CDC strain in a S ml volume. A few 
drops of the virus can also be administered on each of the conjunctiva, 0.5 ml in the nose and the 

5 remainder in the trachea. Infection by SARS-CoV can be assessed by taking nasal, pharyngeal, 
and rectal swabs, and feces from animals for 20 days after challenge (Fouchier et al. (20030 
Nature 423:240). The presence of SARS-CoV in sample materials can be assessed by RT-PCR 
and infection assay of Vero cells. Animals can also be monitored for clinical signs of SARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 

10 and survival. Protection can be determined by the magnitude and duration of virus shedding, by 
duration and severity of disease symptoms, and by percentage of surviving animals. 



Mice 



Group 


Treatment 


Dose/Route 


Sampling interval 


Nmnber of mice 


1-3: 


BPL-SARS-CoV 


20, 10, 5 /ig/SC 


7, 21, 35 d; 
2. 3,4,5 m; 


10 per dose level 


4-6 


BPL-SARS-CoV 


20, 10, 5 n^SC 


42 d 


10 per dose level 


7-9 


BPL-SARS-CoV MF59 


20, 10, 5 ng/SC 


7, 21, 35 d; 
2, 3, 4, 5 m; 


10 per dose level 


10-12 


BPL-SARS-CoV MF59 


20, 10, 5 ng/SC 


42 d 


10 per dose level 


13 


MF59 


NA/SC 


7, 21, 35 d; 
2, 3, 4,5 m; 


10 + 10 (sacrificed at 
42 d and end 5 m) 


14 


Saline 


NA/SC 


7, 21, 35 d; 
2. 3, 4, 5 m; 


10 + 10 (sacrificed at 
42 d and end 5 m) 



Ferrets 



Group 


Treatment 


Route 


Sampling interval 


No. of ferrets 


1 


BPL-SARS-CoV 


SC 


7,21,35 d 


6 


2 


BPL-SARS-CoV-MF59 


SC 


7, 21, 35 d 


6 


3 


Saline 


SC 


7, 21, 35 d 


6 



15 



Macaques 



Group 


Treatment 


Route 


Sampling interval 


No. of macaques 


1 


BPL-SARS-CoV 


SC 


1,4. 7 w 


4 


2 


BPL-SARS-CoV - 
MF59 


SC 


1,4, 7 w 


4 


3 


Saline 


sc 


1,4, 7 w 


4 



EXAMPLE 18: Human T cell responses 

As a prelude to initiation of clinical studies in humans, the reactivity of peripheral blood T 
20 lymphocytes from healthy donors with different HLA haplotypes can be assessed using the in 
vitro priming technique (Abrignani et al. (1990) Proc Natl Acad Set USA 87:6136-40). The aim 
of this study is to have a first indication of the immune-dominant T cell epitopes in SARS-CoV 
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proteins. Briefly PBMCs from 20 healthy donors with different HLA haplotypes will be cultured 
in medium containing 5% autologous serum, in the presence of different concentration of SARS- 
BPI^CoV particles in the range from 0.5 to 20 fig/ml. The expression of activation markers will 
be assessed after 24 and 48 hours. Frequencies of IFN-y- and IL-4- producing T lymphocytes 
will be assessed after 12h and after 15 days in culture, in the presence of 100 U/ml recombinant 
human IL-2. Activated and cytokines producing CD4 T lymphocytes will be sorted and 
eventually cloned as single cells using FACS technologies. The CD4+ T cell repertoire from 
human subjects with different HLA will be assessed by proliferation assays of the CD4+ T cell 
lines and clones against autologous EBV~transformed cell lines loaded with 15-mer overlapping 
peptides from the most relevant structural and non structural protein of the S ARS-CoV. 

When moving to actual human trials, safety and immune responses will be evalOated in 
healthy adults following intramuscular immunization with escalating doses of the BPL- ' 
inactivated SARS-CoV vaccine, with MF59 adjuvant being included or omitted depending on 
preclinical data. Three/four immunizations will be given at 0, 1, 6 months in the first cohort, and 
at 0, 1, 2, 6 months and 0, 2, 6 weeks in the second and third cohorts respectively. The trial will 
be observer blind and placebo controlled. Subjects will be randomized into each dose level. 
Immune response parameters to be measured will include serum neutralizing antibodies, HLISA 
antibodies and peripheral blood IFN-gamma-producing CD4+ T cells by intracellular cytokine 
staiiiing. 



Group 


Antigen 
dose (pig) 


Administration 
schedule 


No. treated 
subjects 


No. subjects 
with placebo 


Sampling interval 


Al 


10 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


A2 


20 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


Bl 


10 


0,1,2,6 months 


18 


12 


0, 1, 2, 6, 7 mos 


B2 


20 


0,1,2,6 months 


18 


12 


0, 1,2, 6, 7 mos 


CI 


10 


0,2,6 weeks 


18 


12 


0, 2, 6, 10, 30 wks 


C2 


20 


0,2,6 weeks 


18 


12 


0,2,6. 10, 30 wks 



EXAMPLE 19: Selection ofCHO cell lines for Spike protein expression 

Methods for the derivation of Chinese Hamster Ovary (CHO) cell lines that stably express 
viral envelope glycoproteins that are conformationally intact, appropriately glycosylated and 
efficiently bind neutralizing antibodies are well established for HIV and HCV (Srivastava et al. 
(2002) J Virol 76:2835-47; Srivastava et al. (2003) / Virol 77:11244-259; Heile et al. (2000) J 
Virol 74:6885-92). The same techniques can be appUed to SARS-CoV, to generate two different 
stable CHOK-1 cell lines producing either full-length or truncated SARS Spike proteins. The 
Spike proteins can be expressed using the constructs described herein, but without the hexa-His 
tags. These proteins can compared for their ability to produce neutralizing antibodies in 
immunized animals as well as for their expression levels in CHOK-1 cells. 
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A pCMV3 vector expressing Spike can be used for the derivation of stable CHOK-1 cell 
lines, containing the CMV enhancer/promoter, ampicillin resistance, and a fused DHFR and 
attenuated neomycin gene for selection purposes. Stable cell lines can produced using the 
neomycin selection system in CHOK-1 cells. Clones can be sequenced to verify the integrity of 
5 the insert, and transient transfections can be performed using Trans-LTl polyamine transfection 
reagent (Pan Vera Corp., Madison, WI) to assess the expression level and also the integrity of the 
expressed protein by ELISA and western blot analysis. 

Initial CHO cells will be selected to be free from TSE/BSE contaminants and risks 
according to relevant regulatory standards. To construct cell lines, procedures involve 
10 transfection, primary screening with selective medium, followed by subcloning to assure purity 
of cell lines. Cell supematants can be assayed using an antigen capture ELISA to quantify 
expression levels at all stages of selection and amplification. For full-length Spike expression, 
methanol fixed cells can be screened for internal expression by immunofluorescent staining 
using a rabbit anti-SARS antibody. Successive measurements at the T75-flask stage of 
15 expansion canbe employed to assure stability of expression levels. The molecidar mass and 

integrity of the expressed proteins can be checked by PAGE both under native and reducing and 
denaturing conditions, followed by inununoprobing. 

The pCMV3 vectors expressing SARS-CoV Spike proteins in eithier fiill-length or 
truncated forms can be introduced into CHOK-1 cells using the Trans-LT-1 reagent and non- 
20 selective media. 24-48 hours post-transfection, depending on cell density, cells are split at a 1:5 . 
ratio and the medium can be changed to selective media containing neomycin at 500jLig/ml. Any . 
bovine serum used in these procedures will be from TSE-free sources that meet regulatory 
standards. Tert to fourteen days later, -individual colonies can be picked and transferred to 96 
well plates and cultured in complete non-selective medium. When approximately 80% of the 
25 wells are confluent, 24 hour supematants can be screened by Spike capture ELISA. For initial 
expression of full length Spike protein, cells can be fixed with methanol and screened by 
inomunofluorescent staining using a rabbit anti-SARS antibody. After low-expressing cell lines 
have been eliminated and there are fewer than 20-30 cell lines, capture ELISA and western blots 
can then be used to determine the expression level after cell lysis. A portion of each cell line can 
30 be pelleted, weighed and lysed in 1% Triton lysis buffer for determination of expression levels. 
Three to four clones producing the highest levels of spike protein in correct structure and 
conformation can be expanded to three-liter bioreactors and adapted to low serum suspension 
culture conditions for scale-up. 

The antigen capture ELISA assay for the SARS spike protein can be performed using 96 
35 well flat-bottom plates coated with 250ng per well of purified immunoglobulin obtained fh)m 
rabbit sera that were immunized with inactivated SARS vims. Supematant or lysate samples are 
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added and incubated for 2 hours at 37°C. Bound antigen is reacted against pooled S ARS*"* 
serum or high affinity monoclonal antibody either human or mouse against SARS spike protein 
and detected using appropriate species-specific peroxidase-conjugated second antibody. The 
plates are developed using TMB substrate (Pierce, Rockford, IL), read at a wavelength of 
450nm, and the concentration of protein per ml sample is derived from a standard curve (OD vs. 
protein concentration) based on serial dilutions of a known concentration of recombinant spike 
protein. 

The immunoprobing analysis will also be performed following the standard methods 
described by Srivastava et al. (2002) supra. Briefly, 10-20/il of the sample is analyzed on 4-20% 
SDS PAGE under non-reducing/denaturing conditions with mild heating. The proteins are then 
transferred onto nitrocellulose membranes and reacted against polyclonal anti-Spike rabbit 
serum, followed by anti-rabbit Ig conjugated to Alexa 688 (Molecular Probes, Oregon). The 
blots are scanned using an infrared imaging system. 

The highest expressing candidate cell lines will be screened for Spike protein expression 
and stability in small-scale (3 liter) perfusion bioreactors. The candidate clones will be further . 
evaluated for level of expression as well as integrity of expressed protein, and subsequently 
tested for expression stability in the absence of selection. The selected clones also will be tested 
for maintenance of the DNA sequence integrity of the integrated SARS spike protein gene. To 
quickly monitor the expression levels in small flasks and in the three liter evaluation cultures, a 
lectin-based process (Gluvanthus Nivalis lectin) has been developed to isolate SARS spike 
protein to a degree of purity that allows semi-quantitation and characterization of the protein in 
CHO supernatant. Full-length Spike protein will be obtained from Triton X-100 detergent 
extracted cells and then captured on GNA lectin, followed by hydroxyapatite and SP 
chromatograph. Eluted protein is then characterized by: (1) polyacrylamide gel electrophoresis 
(PAGE) and Coomassie staining, (2) immunoprobing with anti-SARS rabbit sera, (3) structural 
characterization using size exclusion chromatography (SEC), as well as mass spec analysis using 
MALDI-TOF. 

Productivity from the CHO cell line expressing SARS spike protein should be at least 2 
mg^ and for full-length Spike protein will be 3mg/100gm of cells, at steady-state cell density. 
Yield from one 45 day, 2.5-liter bioreactor will be -1000 mg crude protein. 

EXAMPLE 20: Purification of spike protein for human vaccines 

To purify SARS spike protein for the purpose of producing GMP grade material for human 
use, the following basic process is used, vwth all steps being perfonned at 2-8*'C: the starting 
material, concentrated CHO cell culture supernatant (20-30X) is thawed and filtered through a 
OASfim membrane; this material is heavily contaminated proteins from culture, as well as DNA; 
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the first purification step is affinity chromatography using Gluvanthus Nivalis (GNA), a lectin 
that preferentially recognizes terminal mannose containing carbohydrates; glycosylated proteins, 
including SARS spike protein are captured and non-glycosylated proteins, as well as DNA, do 
not bind to this column; the GNA column is followed by two chromatographic steps operated in 
the flow through mode; the anion exchanger, DEAE, and ceramic hydrpxyapatite (cHAP); 
DEAE binds some contaminating supernatant proteins and DNA, whereas cHAP binds any 
contaminating serum proteins; full-length Spike protein is purified from the cell pellet; tiie cells 
are lysed with Triton X-100 and full-length Spike protein is then captured on GNA lectin, 
followed by hydroxyapatite and SP chromatography. 

The purified SARS spike can be further treated to remove adventitious viruses: viral 
inactivation at pH 3.5 for 1 hour; the sample is then concentrated and diafiltered into a buffer at 
pH 4 and finally captured the purified protein using SP resin; the spike protein binds to .this resin 
and many viruses flow through. 

The spike protein is eluted, concentrated and diafiltered into formulation buffer. This 
formulated bulk product is then filtered through a DV50 viral removal membrane followed by 
filtration through a 0.2 fim membrane. The formulated bulk is filled into suitable containers e.g. 
into 3.0 ml vials, in a class 100 laminar flow hood. 

In process testing at each step of the purification includes protein concentration, endotoxin 
(LAL), bioburden, and recovery. 

Prior to human administration, a test for potency will evaluate the specific ability of the 
vaccine in an in vitro or in vivo test to effect a given response. The in vivo immunogenicity will 
be determined by dosing groups of 10 mice with various doses of the protein antigen. Sera will 
be analyzed for the presence of IgG antibodies using an ELIS A. The criterion for passing will be 
based upon the number of vaccine treated animals that are seropositive compared to a reference 
standard. Other tests include General Safety, sterility, purity, identity of the viaccine (using an 
ELISA specific for Spike protein), and quantity & protein concentration (UV spectrophotometric 
absorbance procedure based on the molar absorbance of the aromatic amino acids). 

Stability testing wiU be performed on the bulk drug substance and on the final container 
product. Bulk product will be evaluated at temperatures of — 60°C (recommended storage 
condition), 25 ±2 °C and 40 ±2 ''C protected from light, at time points of 0, 3, 6, 9, 12 months. 
Final container product will be tested at temperatures of -60 °C, and inverted at 5 ±3 °C, 25 ±2 
*'C, and 40 ±2 at time points of 0, 3, 6, 9, 12 months. Stability-indicating assays may include 
appearance, pH, protein content, SDS-PAGE, size exclusion HPLC, and container/closure 
integrity, performed on single samples of bulk and triplicate vials of final container material. 

The protein purified in this way can be evaluated in mice, rabbits and ferrets as described 
in, and based on the results of, examples 4, 5, 8 and 9 above. 
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Initial experiments will be perfonned in mice to deteimine optimal dose and schedule of 
the GMP Spike protein required to elicit the highest levels of neutralizing antibody, with titers at 
least in the range of 1/100 - 1/1000. Spike protein will be tested in the range from 5 to 40 (ig, 
alone or mixed with an equal volume of MF59-citrate, to anesthetized mice in 100/il of 
inoculum. Groups of BALB/c mice, 10 per treatment, will be immunized. The animals will be 
primed at day 0 and boosted at days 14 and 28. Secondary endpoints will be to compare the 
kinetics of neutralizing vs. Spike-specific antibody titers and to assess the Thl/Th2 profile of the 
specific immune response. Neutralizing and Spike-specific antibody titers will be assessed at 
days 7, 21, and 35 and at 2, 3, 4, and 5 months after priming; the IgG2a and IgGl titers of Spike- 
specific antibodies will be determined at days 21 and 35, and at 2, 3, 4, and 5 months after 
priming; proliferation and IFN-y and IL-4 production by splenic T cell against the recombinant 
Spike protein from S ARS-CoV will be assessed at day 42 and at the end of the 5th month. 
Peripheral blood will be collected at days 7, 21, and 35 and at 2, 3, 4, and 5 months after 
priming; spleen cells at day 42 and at the end of the 5th month. Neutralizing and Spike-specific 
antibody titers and isotypes will be determined by inhibition of SARS-CoV infection of Vero 
cells and by ELIS A, respectively. Proliferation of splenic cells wiU be determined by ^[H]- 
thymidine uptake. Frequencies of splenic IFNTy and IL-4 producing CD4+ T lymphocytes, will 
be determined by ELISPOT and FACS analysis. 

Next, the optimal dosing and schedule for recombinant Spike vaccine will be determined in 
ferrets. Based on the mouse results, the Spike vaccine eliciting the highest antibody neutralizing 
titers will be tested against a two-fold higher dose of recombinant Spike protein given in the 
same formulation. Three groups of ferrets, 6 per treatment, will be immunized SC under 
anesthesia with 200pil of inoculum. The animals will be primed at day 0 aind boosted at days 14 
and 28. Peripheral blood will be collected at days 7, 21, and 35. Neutralizing and Spike-specific 
antibodies titers will be determined by inhibition of SARS-CoV infection of Vero cells and by 
ELISA, respectively. Similar to the previous ferret studies, each group of animals will be used to 
assess efficacy of the vaccine in protecting immunized animals firom infection and/or disease. 

Lmnunogenicity and efficacy of the candidate vaccine also will be evaluated in nonhuman 
primates. Three groups of adult cynomolgus macaques, 4 per treatment, will be immunized with 
recombinant SARS-CoV Spike protein, alone or mixed with an equal volume of MF59-citrate, 
administered SC to anesthetized animals in 500 [il of inoculum. The Spike protein vaccine will 
be tested at the dose eliciting the highest neutralizing antibody titers in ferrets at day 35. The 
animals will be primed at day 0 and boosted at 3 and 6 weeks. Peripheral blood will be coUected 
at weeks 1, 4, and 7. A secondary endpoint will be to assess the Thl/Th2 profile of the specific 
inmiune response, as described above (neutralizing and Spike-specific antibody titers. 
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frequencies of peripheral blood CD4+ T cells producing IFN-7 and IL-4 in response to the 

recombinant Spike protein, assessed at at weeks 1, 4, and 7). 

Finally, human phase I, placebo-controlled, dose-escalation, safety/ immunogenicity trials 

will be performed for the IM recombinant S ARS vaccine with MF59 adjuvant. The trial will 
5 evaluate safety and immune responses in healthy adults following immunization with escalating 

doses of S ARS recombinant vaccine with MF59 adjuvant, administered intramuscularly. 

Three/four immunizations will be given at 0, 1, 6 months. The trial will be observer blind and 

placebo controlled. Subjects will be randomized into each dose level. Immune response 

parameters to be measured include serum neutralizing antibodies, ELISA antibodies and 
10 peripheral blood IFN-y-producing CD4+ T cells by intracellular cytokine staining: 



Group 


Vaccine 
Antigen 
dose (t^) 


Administration 
schedule 


No. of 

treated 
subjects 


No. of subjects 
with placebo 
(MF59) 


Sampling interval 


Al 


50 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 months 


A2 


100 


0,1,6 months 


18 


6 


0, 1,2, 6, 7 months 



EXAMPLE 21: Comparison of inactivated virus and purified Spike protein 

Immunogenicity and efficacy of the inactivated virus vaccine and the purified Spike 
protein can be compared in non-human primates. Three groups of adult cynomolgus macaques, 4 

15 for treatment, will be immunized with recombinant SARS-CoV Spike protein from CHO cell 
lines or with BPL-SARS-COV, given in the dose and formulation eliciting the highest 
neutralizing antibody titers in previous immunogenicity challenge experiments, administered SC 
to anesthetized animals in 500^1 of inoculum. The animals will be primed at day 0 and boosted 
at 3 and 6 weeks. Peripheral blood will be collected at weeks 1, 4, 7. A secondary endpoint wiU ■ 

20 be to assess the Thl/Th2 profile of the specific immune response, as described above. 



Group 


Treatment 


DoseARoute 


Sampling interval 


No. of macaques 


1 


Rec-Spike protein 
+ or - MF59 


Yfig/SC 


1,4, 7 w 


4 


2 


BPL-SARS-CoV 
+ or-MF59 


Y/ig/SC 


1,4, 7 w 


4 


3 


Saline 


NA/SC 


1,4, 7 w 


4 



EXAMPLE 22: Expression in yeast 

Yeast is a useful and inexpensive eukaryotic expression system. Yeast-expressed proteins 
are used in recombinant hepatitis B virus vaccines, and recombinant SARS antigens may also be 
25 expressed in yeast for vaccine purposes. Yeast-expression is also convenient for the production 
of antigens for preparing monoclonal and polyclonal antitobodies, or for use in serological 
assays. 
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The nucleocapsid protein (N) and two different versions of the spike glycoprotein (S) from 
SARS coronavirus FRA strain (AYS 10120) were cloned for expression in S.cerevisiae: 

SARS N: aa 1 -422 (coordinates 28120-29388 of AYS 10120 strain) - Fig.65 

SARS spike: aa 14 - 1195 (transmembrane domain and cytoplasmic tail deleted) - Fig.66 

SARS spike: aa 14 - 662 (SI domain) 
To make the SI construct, a XhoI-NotiE fragment of approximately 373Sbp encoding the full- 
length spike glycoprotein was the starting point. PGR was used to amplify the full-length gene 
in two pieces: Xbal-Blnl of 2440bp and Blnl-Sall of 1306bp. These fragments were subcloned 
into conmiercial vectors (Novagen): pT7Blue2 Xbal-Blnl (5* end of spike glycoprotein) and 
pT7Blue2 Blnl-SaU (3' end of spike glycoprotein; Figure 58), respectively. The following 
primers were used in the subsequent PGR reactions: Spk-1 (5') SEQ ID NO: 9785; Spk-2 (5')' 
SEQ ID NO: 9786; Spk-3 (5') SEQ ID NO: 9787; Spk-4 (5') SEQ ID NO: 9788. 

E. coli HBlOl conipetent cells were transformed with the PGR ligation product and plated ' 
on Luria agar plates, containing lOOjiig/ml ampicillin. The desired clones were identified using 
miniscreen DNA analysis. After sequence verification and plasmid amplification of the desired ' 
subclones, it was desirable to eliminate the internal Sail site present in the Xbal-Blnl portion of 
the spike sequence in order to facilitate future cloning into the yeast expression vector (BamHI- . 
Sail). Therefore, we prepared a Celll-Mfel vector from the pT7Blue2 Xbal-Bbil (5* end Spike) 
subclone to eliminated a 143bp sequence containing the SaU site. Kinased oligos DSl-6 (SEQ 
ID NOS: 9789-9794) were then ligated into the GelH-Mfel vector to replace the 143bp that were • 
removed to mutate the Sail site (no aa changes), creating pT7Blue2!xbaI-BlnIAsaI. 

The 5' Xijal-Bhil (from pT7Blue2.XbaI-BlnI ASal) and the 3' Blnl-Sall (from pT7Blue2 
Blnl-Sall) spike glycoprotein inserts were gel-purified and ligated them into the p893-l Xbal- 
SaU vector (a vector derived from pLitmus 38 (New England Biolabs) with the alpha-factor 
leader sequence cloned into the BamHI -SaU sites of the MGS). The resulting full-length SARS 
Spike coding sequence was named p893-l.SARS Spike 1255 #9 (Figure 58). 

E.coliHBlOl competent cells were transformed with the oligo replacement ligation 
product and plated on Luria agar plates, containing lOOjug/ml ampicillin. The desired clones 
were identified using miniscreen DNA analysis. After sequence verification of the positive 
clones, pT7Blue2 Xba-Bln ASal was chosen for use as a template for PGR reactions to amplify 
the Spike SI 1967 bp Xba-Sal fragment. The fragment was then subcloned into the p893-l Xba- 
Sal vector, sequence verified, and named it p893-l. Spike SI #1 1 (Figure 59). 

In order to clone into the S.cerevisiae expression vector, pBS24.1, the 5' end of the SI 
sequence had to be modified from Xbal to Hindm to allow ligation with the 3' Hindni end of the 
ADH2/GAPDH BamHI-Hindin promoter fragment. From pT7Blue2 Xba-BlnASal (described 
above) an Agel-Salt 1943bp fragment was gel-purified. This firagment was ligated along with a 
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synthetic pair of Hindlll-Agel 30bp kinased oligos (Sl-l+Sl-2 creating the necessary 5' Hindm 
site) into the pSP72 Hindlll-Sall commercial subcloning vector (named pSF72.SARS Spike SI 
#2; Figure 59). Sl-1 had SEQ ID NO: 9795 and Sl-2 has SEQ ID NO: 9796. 

After sequence verification of the positive clone from miniscreen DNA analysis, the 
5 Hindm-Sall fragment was gel purified. The 1365 bp BamHI-Hindin ADH2/GAPDH promoter 
fragment was ligated along with the 1973 bp Hindlll-Sall SI fragment into the pBS24.1 BamHI- 
Sall vector creating the genetically engineered pd.SARS Spike SI #2 expression plasmid (Figure 
60). 

S.cerevisiae strain AD3 was transformed with pd.SARS Spike SI #2 and single 
10 transformants were checked for expression after depletion of glucose in the medium. The 
recombinant protein was expressed at high levels in yeast, as detected by Coomassie blue 
staining. In particular, yeast cells were transformed with the SARS SI expression plasmid using 
the Invitrogen S.c. EasyCorhp™ Transformation Kit. Expression in shown in Figure 57. 

To express Spike 1195 protein, which does not contain the trans-membrane (TM) region or 
15 cytoplasmic tail that are present in the full-length SARS construct, the following series of genetic 
manipulations was performed: 

From pT7Blue2 Blnl-Sall #11 (described above) a Blnl-Dral 1056bp fragment was gel 
purified. This fragment was ligated with a synthetic pair of 68bp Dral-Sall kinased oligos 
. (DRSl+2; SEQ ID NOS: 9797 & 9798) into a pT7Blue2 Blnl-SaU vector (Figure 61). E.coli 
ZO HBlOl competent cells were transformed with the oUgo replacement ligation product and plated - 
on Lima agar plates, containing lOO/tg/ml ampicillin. The desired cloiies were identified using 
miniscreen DNA analysis. After sequence confirmation the clone was named pT7Blue2 Blnl- 
Sal Spike 1 195 #7. The 1 126bp Blnl-Sall fragment encoding the 3' end of the Spike 1 195 was 
gel purified (Fig.61). 

25 In order to generate the Xbal-Sall Spike 1 195 fragment, the 3 109bp Xbal-Pcil fragment 

was isolated from the p893-l.SARS Spike 1255 #9 (described above) and a 457bp Pcil-Sall 
fragment firom pT7Blue2.SARS Spike 1195 #7 (described above). The two fragments were 
cloned into the p893-l Xbal-Sall vector, creating the p893-l .SARS Spike 1 195 #34 plasmid 
(Figure 62). 

30 To clone SARS Spike 1 195 into the pBS24. 1 Saccharomyces cerevisisae expression 

vector, it was necessary to modify the 5' end of the SARS Spike 1195 from Xbal to Hindin, as 
done for the Spike SI expression clone described above. To begin, the 2416bp Agel-Blnl 
fragment was isolated from p893-l.SARS Spike 1195 #34. This fragment was ligated with the 
synthetic Hindlll-Agel 30bp oligos (described above to generate the SI protein for expression in 

35 S.cerevisiae) into the pT7Blue2 HindQI-Blnl vector. E. coli HBlOl competent cells were 
transformed with the oligo replacement ligation product and plated on Luria agar plates, 
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containing lOOjug/ml ampicillin. The desired clones were identified using miniscreen DNA 
analysis. After sequence verification of the positive clone and plasmid amplification of 
pT7Blue2.SARS 1195 5' Hindin-Blnl #10 (Figure 63), we isolated a 402bp Hindm-Ncol 
fragment and the 2044bp Ncol-Blnl fragment (Figure 63). It was necessary for the Hindlll-Blnl 
isolation to be done in two steps to avoid cloning issues related to the internal Hindm site 
located at nucleotide number 1319 of the spike 1195 protein. 

To assemble the BamHI-Sall-expression cassette of Spike 1195 into the pBS24.1 vector 
Kcoli HBlOl competent cells were transformed with the the BamHI-Hindin (ADH2/GAPDH 
promoter), Handni-Ncol 402bp fragment, Ncol-Blnl 2044bp and the Blnl-Sall 1126bp fragments 
into the pBS24.1 BamHI-Sall vector. The samples were plated on Luria agar plates, containing 
100|ig/ml ampicillin. The desired clone was identified using miniscreen DNA analysis, thus 
creating the genetically engineered pd.SARS Spike 1195 #10 (Figure 64). 

■■ S.cerevisiae stirain AD3 was tiransformed with pd.SARS Spike 1 195 #10 and single 
bransformants were checked for expression after depletion of glucose in the medium. The 
recombinant protein was detected by Coomassie blue staining. In particular, yeast cells were 
transformed with the S ARS 1 195 expression plasmid using the Invitrogen S.c. EasyCompT™ 
Transformation Kit. 

EXAMPLE 23: Expression in mammalian cell lines 

cDNA fragments containing the S protein ORF of 1255 amino acids were ampHfied by 
RT-PCR from SARS viral RNA (Frankfurt isolate) grown in Vero cells. The ampHfied PGR 
fragments were cloned into pBlueScript vector, sequenced, and consensus spike sequence was 
assembled to create a fiill-length SARS spike clone, pBSnSh. In vitro transcription of pBSnSh 
followed by translation in a rabbit reticulocyte lysate resulted in the pix^duction of single 
polypeptide with an estimated molecular mass of -140 kDa. 

The insert of this plasmid was recloned via Xhol and Not I into a mammlian expression 
vector pCMVm (Srivastava etal. (2003) J. Virol 77:11244-11259) to create a constiiict, nSh 
(Fig. 74A). A PGR fragment containing a spike protein of 1195 amino acid, which was deleted 
for transmembrane (TM) domain and cystein-rich cytoplasmic tail (Cy) was amplified and 
cloned pCMVm vector to generate the contstiuct nShATC (Figure.74B). Both constoiicts were 
tagged with six histidine residues at the C-terminus in order to aid in their characterization. The 
Xho I/Not I fragment without a histidine tag also was subcloned into the alphavirus replicon 
vector backbone pVCRchim2.1 for use in the production of an alphavirus replicon particle 
chimera that expresses S protein. Production and characterization of the replication defective 
alphavirus vector particles was performed essentially as described previously (Perri et al. (2003) 



-317- 



wo 2004/092360 



PCT/US2004/011710 



J. Virol 77:10394-10403; Polo et oL. (1999) PNAS USA. 96:4598-4603). The resultant alphaviras 
vector particles were named as VEE/SIN. 

COS? cells and BHK-21 cells were maintained in Dulbecco's modified Eagle's medium 
supplemented with 10% fetal bovine serum at 37°C and 5% CO2 in air. COS7 cells were 

5 transfected with expression plasmids (nSh, nShATC) using a transfection kit (TransIt-COS, 
Minis) following the manufacturer's protocol. The cells were washed once with ice-cold PBS 
and lysed with Ix Lysis buffer (20mM MOPS, lOmM NaCl, 1.5mM MgClz, and 1% Triton X- 
100) containing complete mini protease inhibitor (Roche). After a 30-min incubation on ice, the 
debris was cleared by centrifugation. The cleared lysate was either puriified or used directly in 

10 western blotting. 

To purify secreted spike proteins, medium from transfected cells was collected and 
subjected to centrifugation at 12,000 rpm for 10 min to remove cellular debris. The cleared 
medium was applied to a ConA-agarose column (Vector Lab). The column was washed 
extensively with 20mM sodium phosphate buffer, and then the bound proteins were eluted with 

15 IM methyl a-D-mannopyranoside (MMP), IM NaCl in 20mM sodium phosphate buffer. 
Column fractions containing SARS-CoV spike proteins were applied to MagneEis Protein 
purification system (Promega) following the protocol suggested by the manufacturer. 

For western blot analysis, proteins were separated by 4-20% SDS-PAGE and then 
transferred electrophoretically to nitrocellulose membrane (Invitrogen). Membrane was blocked 

10 in blocking buffer (5% skim milk and 0.1% Tween 20 in PBS) and incubated with indicated 
antibody at room temperature for 1 hr, washed and probed with horseradish peroxidase (HRP)- 
conjugated secondary antibody (Biosource) followed by chemiluminescence (ECL system, 
Amersham) and exposed by X-ray films. The antibodies used were a mouse monoclonal anti- 
histidine antibody (anti-His»tag Mab, Novagen), a rabbit polyclonlal antipeptide antibody against 

25 SARS-CoV spike proten (SmPab, Abgent), or rabbit anti-SARS sera (2BE) obtained by 
immunization of rabbits with purified SARS-CoV virion. The latter has a cell culture 
neutralizing titer of 1/2,500. Unless stated otherwise, antibody was used at 1/1,000 for anti- 
histidine antibody and SmPab and 1/10,000 for anti-SARS rabbit sera. 

Some spike proteins were treated with Peptide-N glycosidase F (PNGase F). CeU lysates 

30 were diluted in 0.5% SDS and 1% 3-mercaptoethanol and denatured at 100°C for 10 min. After 
2-fold dilution with 1% NP-40 in 50mM sodium phosphate (pH 7.5), the samples were treated 
with PNGase F (NEB) at 37°C for 1 hr. Enzyme-treated samples were analyzed by 4-12% SDS- 
PAGE in reducing condition. For a partial digestion with the PNGase, the cell lysates were 
diluted with 50mM sodium phosphate (pH 6.0) containing 0.75% Triton-X and treated with 

35 PNGase F (Calbiochem) at 37°C for 3 hr. Enzyme-treated samples were analyzed by 4-20% 
SDS-PAGE in nonreducing condition. 
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Western blots of cells 48-hours after transfection are shown in Figure 75. The S protein 
was detected in cell lysates as a doublet with estimated molecular weight of -170 -180 kDa , 
when the lysate was boiled and analysed under reducing SDS-PAGE conditions (Fig. 75A, lane 
3). This doublet appears to result from differential glycosylation of one polypeptide product 
since pre-treatment of the cell lysate with PNGase F reduced the doublet to a single species of 
-140 kDa (Fig. 75A, lane 4). This is the expected size predicted from the aa sequence for a full- 
length, intact polypeptide product. This experiment indicates that the full length SARS-CoV S is 
expressed in mammalian cells as a single, uncleaved polypeptide, but in two differentially 
glycosylated forms, gpl70 and gplSO respectively. Unlike the two S glycofonns encoded by the 
fuU-length sequence, none of which were secreted, the S A protein product was detected both in 
ceU lysates (Fig. 75A, lane 5) as weU as in the cell culture medium (Fig. 75B, lane3) as a single 
species of - 160 kDa. 

In order to further characterize the intracellular processing of the S protein, and as 
described above, BHK21 cells were infected with defective alphavirus particles expressing the 
full-length S. At 6 hr post infection with a MOI of 5, infected cells were pulse labeled for 1 hr 
with Lrf^S] methionine/cysteine and chased for 2 or 4 hours. The [^^S]-labeled S protein was 
inomuno-precipitated using the rabbit antiserum raised against inactivated, purified virus and 
then digested with Endo H. The Endo H treatment involved dilution with a sample buffer (50mM 
sodium phosphate. 0.1% SDS, 50 mM DTT, pH 6.0) and boiUng for 5 min. After denaturation. • 
the samples were further diluted with 0.75% Triton-X 100 and treated with endoglycosidase H 
(Endo H) following manufacturer's protocol (Calbiochem) for 3 hr at 37°C. Enzyme-treated • 
samples were added with gel loading buffer containing 0.1% SDS and DTT and analyzed by 8% 
SDS-PAGE. 

Both digested and undigested proteins were boiled in SDS and analysed by reducing SDS- 
PAGE (Figure 55). After a 1-hr pulse, the S protein was apparent as a single gpl70 component 
that was Endo H sensitive Ganes 1 and 2). After a 2-hr chase, a new species (gpl80) was present 
along with gpl70 in approximately equal proportions (lane3). After a 4-hr chase, the gpl80 
species was the dominant S protein component Qane 5) that was now Endo H resistant (lanes 5 
and 6). This data is consistent with gpl70 being an ER-resident glycoprotein containing high 
mannose chains and with gpl80 corresponding with a Golgi-piocessed glycoprotein containing 
Endo H-resistant complex oligosaccharides. 

The Endo H sensitivity of the C-terminus deleted S A protein purified from cell culture 
media ws also tested. As shown in Figure 76. the SA observed within cell lysates was found to be 
Endo H sensitive (lanes 1 and 2), while the secreted SA in cell culture media was Endo H 
resistant (lanes 3 and 4). This result is consistent with this glycoprotein being synthesized in an 
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immature form in the ER prior to transfer to the Golgi where the complex carbohydrate is added 
and the protein then secreted. 

As akeady described, the S protein expressed in COS7 cells was detected as a 
gpl70/gpl80 doublet in western blot analyses of cell lysates that were fully denatured by boiling 
5 in the presence of DTT. However, the majority of S protein was detected as a high molecular 
glycoprotein in the 440-669 kDa range when the same cell lysate was not heat-denatured prior to 
western blot analysis using SDS-PAGE (Fig. 77, lane 1). The -500 kDa species was resistant to 
10 mM DTT treatment (lane 3) and not dissociated into the monomeric form unless the lysate 
was first heat-denatured at 100°C (lane 4). In contrast, oligomeric form of a test protein 

10 (Thyroglobulin) of which quaternary structure is held by disulfide-linkage was converted into 
subunit form by the 10 mM DTT treatment. These data suggest that the ~500 kDa oligomeric 
form of S protein is not disulfide-linked and is heat labile. To confirm the heat-sensitivity of the 
-500 kDa species of S protein, the heat-denaturation experiment was repeated but without DTT. 
As shown in Figure 78, heat denaturation of -500 kDa protein at 100°C alone was sufficient to 

15 convertit into gpl70/180 monomeric forms (lane 4), Using a 80°C heat-denaturation step, both . 
the -500 kDa and monomeric forms were detectable in similar proportion (lane 3). 

In order to investigate further whether this -500 kDa species represents an S protein 
oligomer in native conformation, comparative analyses with virion-derived S glycoprotein 
derived from Vero cell cultures weis performed. The purified virions were solubilised in 1% SDS 

20 prior to Western blot analyses after SDS PAGE. The presence of the -500 kDa spike protein 
oligomer was confirmed in virion particles (Fig. 79, lane 1). In addition, heat denaturation of 
solubilised virions produced the same oligomer-to-monomer conversion as seen with the full- 
length recombinant S (lanes 2,3). The oligomeric nature of virion S was further analysed in a 
cross-linking experiment. Aliquots of inactivated virion from sucrose gradient fractions were 

25 treated with 10% SDS at 1% final concentration and diluted 2-fold with 0.2M Triethanolamine- 
HCl (pH 8, Sigma); Dimethyl suberimidate (DMS; Pierce Chemical Co.) was then added fi-om a 
freshly prepared solution (lOmg/ml in 0.2M Triethanolamine-HCl) at 3:3mg/ml final 
concentration. After 2 hr at room temperature, samples were concentrated with Centricon-30 and 
analyzed by silver staining after electrophoresis on a 4% polyacrylamide gel. Both untreated and 

30 DMS cross-linked virion proteins were heat-denatured, and the heat effect on the maintenance of 
oligomer structure was analysed by SDS-PAGE and silver staining (Figure 80). In the absence of 
cross-linking, heat denaturation resulted in the replacement of the ~500 kD spike protein species 
with the monomer species. In contrast, in the cross-linked proteins, the levels of the -500 kD and 
monomer species did not change significantly after heating. These data support the fact that the 

35 -500 kD protein is an oligomer of S monomer proteins that are bound lion-covalently. After 
cross-linking and boiling, the -500 IdDa species migrated as a somewhat slower diffuse form 
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than the untreated fonn. This mobility shift is probably due to a structural change resulting from 
boiling. In addition, a minor protein species of -300 kD, which may represent a non-dissociated 
S dimer, could be seen. 

To estimate more precisely the size of the recombinant -500 kDa S species expressed in 
COS7 cells, a COS7 cell lysate containing the S protein oligomer was fractionated using size- 
exclusion column chromatography. The major portion of the -500 kDa oligomer co-eluted with a 
572 WDa marker protein. Taken together, these experiments suggest that the -500 kDa S species 
seen in COS7 cell lysates is probably a homotrimer of the S protein monomer. 

The oligomeric status of the SA spike protein was also examined after expression in COS7 
cells. As shown in Figure 81, the recombinant SA proteins present in cell lysates were also 
detected in high molecular weight forms of -500 kDa range when the lysate was not heated prior 
to SDS-PAGE and Western blot analysis (lane 1). However, the efficiency of oligomerization by 
intracellular SA protein appears to be much less (<10%) compared to that of full-length S protein 
under the same western analysis conditions. A heat-sensitivity test on this -500 kDa protein 
showed that the SA oligomer was more heat labile than that of the full-length ,S oligomer, as 
demonstrated by the >90% conversion of all of the -500 kDa species into monomeric Sd forms 
at 80°C (lane 2). Also (Figure 82), the majority of die secreted SA protein was found in 
monomeric form with the -500 kDa species barely detectable (and only, detectable when ihe 
protein was loaded in excess for Western analysis) (lane 1). At a temperature above SO°.C, all 
secreted SA proteins were detected as monomers (lanes 2, 3). 

The ~500kDa protein is glycosylated, and the effect of deglycosylation on its antibody 
binding was investigated. The recombinant COS7 lysate was treated with PNGase F under non- 
denaturing condition (as described above) and analysed by western blot. As shown in Figure 83, 
deglycosylation did not affect the binding of anti-histidine Mab antibody to the treated S 
oligomer (lanes 2,3). However, it compromised the reactivity with the rabbit antisera raised 
against purified virus Gane 6). This antiserum binds to virion-derived S in westem blot analyses 
only when DTT is omitted from the sample for SDS-PAGE indicating that it recognizes 
primarily a discontinuous, conformational epitope(s). This antisera has also been shown to have 
a high-titer of viral neutralizing antibodies. Its lack of binding to deglycosylated, recombinant S 
suggests that the carbohydrate actively contiibutes to the higher order, native structure of the S 
polypeptide oligomer. 

The difference between the recombinant S and S A protein is the presence or absence of the 
TM-and Cys-rich domains at the C-terminus. This difference predicts that full-length S would be 
found associated with the membrane fraction while Sd would be in the soluble fraction upon 
lysis of tiransfected cells. Therefore, nSh- or nShATC-transfected cells were lysed under 
hypotonic conditions and the soluble cytosol fraction was separated fi-om the insoluble 
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membrane fraction by centrifugation (Figure 48). As shown in Kgure 84, the S protein was 
found in the membrane fraction (DF) both as a -500 kDa and 180/170 kDa species (lane 4) but 
was not detectable in the soluble cytosol fraction (AF) (lane 3). However, the truncated SA 
protein was found as a monomelic species (gpl70) in both fractions (lanes 5,6). This indicates 
5 that the C-terminal TM and Cys-rich domains are required for the anchorage of the S protein to 
ceU membrane. 

The cellular location of the S and S A proteins in COS? cells was analyzed by indirect 
immunofluorescence microscopy. At 48 hr post-transfection, cells were directiy fixed with 2% 
paraformaldehyde without detergent for cell surface staining or treated with detergent followed 

10 by Cytofix/Cytoperm solution for intracellular staining. Fixed cells were then stained with rabbit 
anti-S ARS sera (2BE) and FTTC-conjugated antibody. The nSh-transfected cells showed foci of 
S protein indicative of Gdlgi-localisation (Figure 85A), while the nShATC-transfected cells 
displayed a uniform distribution of S A protein throughout the cytoplasm indicative of ER 
localisation (Figvire 85B). While the complete S protein was also observed on the surface of 

15 transfected cells in unfixed cells (Figure 85D), the SA was undetectable on the cell surface 
(Figure 85E). These results indicate the role played by the TM-and Cys-rich domains in 
anchoring the S protein to tiie plasma membrane. Although the TM-regibn alone is likely 
responsible for membrane anchorage, the potential role played by the Cys-rich region remains to . 
be determined. 

10 The SARS recombinant full-length S protein is thus an N-Unked glycoprotein with an 

estimated molecular weight of 170-180,000 kDa. Deglycosylation with PNGase F resulted in a • 
polypeptide of the expected size for the uncleaved, encoded polypeptide (140kDa). Both 
transient and stable expression of the full-length SARS-CoV S gene in a variety of mammalian 
cells, including COS7, 293, BHK21, and Huh7 cell lines, consistentiy produced a S protein 

15 doublet (gp 170/1 80) as detected in western blot analyses. Pulse-chase analyses of transfected 

cells demonstrated that the SARS CoV S protein was initially synthesized as an Endo H sensitive 
gpl70 species followed by the gradual appearance of an Endo H resistant gplSO form, 
presumably as a result of the addition of complex carbohydrate within the Golgi apparatus. 
The recombinant S protein was not secreted into the cell culture medium unless the C- 

30 terminal 60 amino acids containing the TM-region and the Cys-iich tail were deleted. 

The quaternary stracture of the full-length recombinant S protein was investigated using 
cross-linking treatment, heat-denaturation, and size fractionation analyses. The results data are 
consistent with the recombinant S protein existing as a homotrimer of ~i500kDa. Similar analyses 
of virion-derived S yielded the same results. Such a trimeric structure has been reported for other 

35 enveloped RNA viruses: the hemagglutinin HA of influenza virus, the E1-E2 heterodimer of 
alphaviruses and the G protein of vesicular stomatitis vims. Incubations under reducing 
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conditions indicate that the SARS-CoV S trimeric structure is non-covalently associated, and is 
very stable. S oligomers present in the cell lysate were shown to be resistant to reduction by 10 
mM DTT, detergent treatment with 1% SDS, and heat denaturation at up to 60°C . Incubation at 
a temperature higher than >80°C resulted in the dissociation of the trimeric complex as 
evidenced by the decrease in trimer with the concomitant increase in the monomer bands. The 
temperature-induced appearance of the high-mannosylated gpl70 (ER monomer form) as well as 
the complex-glycosylated gpl80 (Golgi monomer form) suggests that trimerization can occur 
before the transport of the monomer spike protein to the medial Golgi apparatus. This is 
consistent with other reports for TGEV, influenza virus HA, and vesicular stomatitis virus G 
proteins. With these proteins, trimerization was reported to take place before addition of complex 
oligosaccharides in the Golgi apparams. 

The C-terminally truncated form of S was found in the cell lysate in both oligomeric and 
monomeric forms at a frequency of 10% and 90%, respectively. The truncated protein secreted 
into medium was found fully glysosylated and it was essentially all in monomeric form. We 
conclude that the C-terminal 60 amino acids of the S glycoprotein contains a membrane anchor ->■■ 
region that affects the efficiency of trimerization. In S protein trimerization, it is possible that the 
C-terminal region is required to initiate the event and the triple-stranded coiled coil structures in 
the S2 stalk domain provide further stabilizing force as seen in HA oligomer of influenza virus. . 

EXAMPLE 24: CHO cells for Spike protein expression 

CHO cell lines that stably express either the full-length or truncated SARS-CoV spike 
proteins were prepared. Several stably transfected CHO cell lines were obtained, and Figure 73 
shows western blot data from a panel of representative clones. 

EXAMPLE 25: Expression in Kcoli 

All SARS-CoV ORFs (Figure 17, Table 10) were cloned in the pET vector and expressed 
as C-terminal His-Tag fusion proteins in E.coli. The proteins smaller than 16KD were also 
expressed as N-terminal GST (Glutathione S-transferase) fusion proteins using pGEX vector. 

Nspl and Nsp2, the two SARS-CoV proteins with proteolytic activity, were not expressed 
as full length proteins due to toxicity in E.coli. The respective genes were instead cloned in 
different portions in order to separate; the catalytic residues (Cys833/ His994 for Nspl; 
His41/Cysl45 for Nsp2) in the resulting recombinant proteins: Nspl A from nucleotides 2719- 
5214 of AY310120; NsplB from nucleotides 5218-7371; NsplC from nucleotide 7372-9984; 
Nsp2A from nucleotide 9985-10416; Nsp2B from nucleotide 10476-10902. 

Nsp9 (SEQ ID NO: 9775) was divided into two portions: Nsp9A from nucleotide 
13371- 14756; Nsp9B from nucleotide 14757-16166. 
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Matrix (M), ORF3 and ORF7 contain respectively three, two and one transmembrane 
domains. TTiese proteins were expressed as deletion proteins excluding the first 100 amino acids 
(M and ORF3) or the first 18 amino acids (ORF7) that include the hydrophobic regions. 
The cloned sequences are shown in Table 26. 

5 A two-step strategy was used to amplify the cloned sequences. In the first step, 

amplification of DNA fragments containing more than one gene or single gene used sequenced 
cDNA as template. Eleven cDNA sequences were amplified: (1) a fi'agment, named amplCl, 
including genes coding for protein E, protein M, orf 7-8-9-10; (2) a firagment, named amplC2, 
including genes coding for orf 3-4; (3) a fragment, named amplCS, including genes coding for 

10 inroteins Nspl2 and Nspl3; (4) Nspllgene; (5) P28 and P65 genes; (6) NsplB and NsplC genes 
portion; (7) a fragment, named amplC9, including genes coding for proteins Nsp2 and Nsp3; (8) 
a fragment, named amplNsp4-7, including genes coding for proteins Nsp4, Nsp5, Nsp6, Nsp7 
and for amplification of Nsp9A gene portion; (9) Nsp 9B gene portion and NsplO gene; (10) a . 
fragment, named amplCO, including genes coding for proteins Orf 1 1, Nucleocapsid (N) and 

15 Oifl2; (11) NsplA gene portion. The primers used in this first step are given in Table 27: 

. In the second step, amplification of single genes was performed using DNA fragments 
from the first amplification step as templates. The primers are shown in Table 28. 

Of the proteins where expression was seen, it was either in inclusion bodies (insoluble) or 
in a soluble form. Purification proceeded on appropriate material. Table 29 shows the molecular 

10 weight of the expressed fragments of SARS-CoV ORFs, whether they were cloned (+ or -), 
whether the cloned fragment was seen to be expressed (+ or -) and the form of protein which 
was chosen for purification. 

Where a protein was a soluble His-tagged product, a single colony was streaked and grown 
overnight at 37°C on a LB/Amp (100 fig/ml) agar plate. An isolated colony from this plate was 

25 inoculated into 20ml of LB/Amp (100 jttg/ml) liquid medium and grown overnight at 37°C with 
shaking. The overnight cultore was diluted 1:30 into 1.0 L LB/Amp (100 /ig/ml) liquid medium 
and allowed to grow at the optimal temperature (30 or 37°C) until the OD550nm reached 0.6- 
0.8. Expression of recombinant protein was induced by addition of IPTG (final concentration 1.0 
mM) and the culture incubated for a further 3 hours. Bacteria were harvested by centrifugation at 

30 8000 X g for 15 min at 4°C. The bacterial pellet was resuspended in 10 ml of cold buffer A (300 
mM NaCl, 50 mM phosphate buffer, 10 mM imidazole, pH 8.0). Cells were disrupted by 
sonication (or French Press) on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. Supematants were mixed with 150/xl Ni^^-resin 
(previously equilibrated with buffer A) and incubated at room temperature with gentle agitation 

35 for 30 min. The resin was Chelating Sepharose Fast How (Pharmacia), jprepared according to the 
manufacturer's protocol. The batch-wise preparation was centrifuged at 700 x g for 5 min at 4''C 

-324- 



BNSDOCID: <W O g 0040aZ3eOAg I > 



wo 2004/092360 PCT/US2004/011710 

and the supernatant discarded. The resin was washed twice (batch-wise) with 10ml buffer A for 
10 min, resuspended in 1.0 ml buffer A and loaded onto a disposable column. The resin 
continued to be washed with buffer A at 4°C until the OD280nm of the flow-through reached 
0.02-0.01. The resin was further washed with cold buffer B(3G0 mM NaCl, 50 mM phosphate 
buffer, 20 mM imidazole, pH 8,0) until the the OD280nm of the flow-through reached 0.02-0.01. 
The His-fusion protein was eluted by addition of 700/il of cold elution buffer C (300 mM NaCl, 
50mM phosphate buffer, 250 mM imidazole, pH 8.0) and fractions collected until the ODasomn 
indicated all the recombinant protein was obtained. 20/il aliquots of each elution fraction were 
analyzed by SDS-PAGE. Protein concentrations were estimated using the Bradford assay. 

Where . a protein was seen as an insoluble product, the inclusion bodies were purified as 
follows: homogenize ceUs (5 g wet weight) in 25 ml O.IM Tris HCl pH 7, ImM EDTA, at 4°C 
using an ultraturrax (10000 rpm); add 1 .5mg lysozyme per gram cells; mix shortly with an 
ultraturrax and incubate at 4°C for 30'; use sonication or high-pressure homogenization to 
disrupt the cells; to digest DNA, add MgCh to a final concentration of 3mM and DNase to a 
final concentration of lOug/ml and incubate 30' at 25°C. add 0.5 vol of 60mM EDTAi 6% Triton 
x-100, 1.5M NaCl pH 7.0 to the solution, and incubate for 30' at 4''C; centrifiigation at 31000 g 
for 10' at 4°C; re-suspend pellet in 40ml of 0;1M Tris HCl pH 7.0, 20 mM EDTA using 
ultraturrax; centrifugation at 31000 g for 10' a 4°C; store the IB pellet at - 20°C. 

The results of expression are shown in Figures 86 to 105. Examples of purity and yield are 
given in Table 30, 

EXAMPLE 26: Retention of critical epitope on truncated Spike antigen 

A human monoclonal antibody having neutralizing activity was tested in an ELISA assay 
for reactivity with the purified truncated Spike protein. Briefly, ELISA plates were coated with 
truncated form of the spike protein at a concentration of l/tg/ml (lOOjwy well) by incubating the 
plates ovemi^t at 4°C. The plates were washed, non-specific binding sites were blocked and 
then different dilutions of the antibody were added and plates were incubated for 1 hour at room 
temperature. At the end of incubation, the plates were washed and bound antibody was detected 
by using anti-human IgG conjugated to horse radish peroxidase (HRP) and an appropriate 
substrate. The optical density of each well was recorded at 405 imi'using an ELISA reader. The 
data are shown in Figure 69 and clearly demonstrate that the neutralizing epitope recognized by 
the mAb is preserved and exposed on the recombinant truncated Spike protein. 

EXAMPLE 27: Different Spike vaccines 

Purified truncated spike protein was used to immunize mice and the level of binding 
antibodies induced against the truncated spike protein was determined by EUSA assay. Briefly a 
group of 10 mice were immunized with 3^g of truncated spike protein adjuvanted in MF59 at 0, 
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4 and 8 weeks intervals. Sera samples were collected from these animals and assayed for 
antibodies induced by truncated spike protein in an.ELISA assay. An additional group of 8 mice 
was immunized with 75 /xg of DNA encoding the truncated form of the spike protein on PLG 
particles at 0, 4 and 13 weeks intervals, the sera were collected and analyzed as above for anti- 
spike antibodies as above 

The profile of binding antibodies induced in each group was plotted as geometric mean 
titer (GMT). Compared to a plasmid DNA vaccine expressing truncated spike antigen and 
delivered using a PLG microparticle formulation, the purified truncated spike protein was 
significantly more potent for inducing strong antibody responses. Further comparison with the 
antibody responses induced by inactivated BPL-SARS-CoV (already shown protective) in the 
same mouse strain revealed that the magnitade of antibody responses induced by purified 
truncated spike protein and the inactivated virus vaccine are in the same range (Figure 70). 

The neutralization potential of antibodies induced by the recombinant truncated spike 
protein, or plasmid DNA expressing the same spike antigen, were also evaluated. The GMT 
values obtained in both groups are shown in Figure 71. From these data; it appears that the 
purified protein is significantly more effective at inducing neutralizing antibody responses 
against the S ARS-CoV spike. Furthermore, the neutralization titers typically induced by the 
purified truncated spike protein are comparable to neutralization titers induced by an inactivated 
SARS-CoV vaccine. 

Figure 72 shows a comparison of antibody binding levels (ELISA, X-axis) with 
neutralization titers (Y-axis). In general there is a very good correlation between the binding and 
neutralizing antibodies. The bottom-left grouping shows ratios 2 weeks post-Srd immunization 
with the DNA vaccine; the top-right grouping shows ratios 2 weeks post-2nd immunization with 
the protein vaccine. Both forms of vaccine show a consistent correlation. 

In further experiments, the ability of a DNA vaccine to invoke an immune response in 
naice was smdied. Mice were immunized with pCMV-nSdTC plasmid, either free or with PLG 
microparticles. Serum from the mice was then used as the staining antibody against cultured 293 
cells that had been transfected with spike, either full-length or truncated. The cells were 
centrifuged prior to testing and the pellet was lysed. The antibody was tested against the culture 
supematant and against the cell lysate. As shown in Figure 1 12, the mouse serum was able to 
detect spike protein in the lysate of cells that expressed fuU-lengOi spike and in the supematant 
of cells that expressed the truncated spike protein. Results were comparable to the staining seen 
when using rabbit serum that had been obtained after inmiunization with whole killed virus. 
Thus anti-spike antibodies can be induced by the use of DNA vaccination. 
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EXAMPLE 28: Expression cassettes in pCMV 

The sequence of plasmid pCMVKm2 is given as SEQ ID NO: 9923. Genes encoding the 
spike protein either in full-length form (pCMVKm2 SARS Spike nS; SEQ ID NO: 9921) or in its 
ATC form (pCMVKm2 SARS Spike nSATC; SEQ ID NO: 9922) were inserted into this basic 
vector. 

Mice were immunized with these vectors, and with similar vectors encoding the N, M or E 
proteins. Vectors encoding the same proteins but with optimized codon usage were also 
prepared. Codons were optimized for efficient human expression starting from the FRA 
sequence (GenBank: AYS 10120). The optimized sequences are: N (SEQ ID NO: 9924); M (SEQ 
ID NO: 9925); E (SEQ ID NO: 9926). 

After administration, expression of proteins could be detected by immunofluorescence in 
all cases. For example. Figure 106 shows immunofluorescence (using anti-SARS rabbit serum) 
results after administration of the vector encoding optimsed N antigen, revealing high level 
expression. Mice receiving the control vector alone showed no fluorescence. 

Figure 107 compares inmiunofluorescence (using Abgent anti-M antibody) of the native M 
sequence (107A) or the codon-optimsed M sequence (107B). Similarly, Figure 108 compares 
immunofluorescence (using Abgent anti-E antibody) of the native E sequence (108A) or the 
codon-optimsed E sequence (108B). 

Four groups of mice (8 mice per group) were immunized with: (1) SARS nS Spike, nSdTG 
truncated Spike, and N proteins; (2) pCMV-SARS-nSdTC: DNA+DNA-PLG at weeks 0,4 and 
13 wks; (3) CMV-nS: DNA+DNA-PLG+VEE/SIN Rep at 0, 4 and 9 wks; (4) VEE/SIN Rep- 
SARS-nS three times at 0, 4 and 13 wks. Sera from all groups recognized SARS nS and nSdTC 
proteins, and also showed virus binding and neutralization activity. 

EXAMPLE 29: Spike protein cleavage 

To investigate the effect of proteolytic cleavage on SARS-CoV Spike protein, it was 
expressed in various forms in E.coli, including: (1) full-length S1-S2; (2) SI alone; (3) HRl 
heptad; and (4) HR2 heptad. The expressed proteins were used to raise immune rabbit sera, 
which were then used for visualizing western blots of Vero cells, either infected or not infected 
with SARS-CoV. 

Figure 109 shows a westem blot using a 1:10000 dilution of antibody raised against eitiier 
the SI domain or the uncleaved S1-S2 domains. Figure 110 shows a westem blot using a 
1 : 10000 dilution of antibody raised against each of the four proteins. The difference in antigen 
reactivity is readily apparent. 

Figure 111 shows similar data. Each serum was tested against four lanes, witii those gour 
lanes being from left to right: (a) serum at 1:500 dilution, SARS-CoV-infected cells; (b) serum at 
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1:500 dilution, non-infected cells; (c) seram at 1:2500 dilution, SARS-CoV-infected cells; (d) 
serum at 1:2500 dilution, non-infected cells. Again, the difference in antigen reactivity is readily 
apparent. 

Figures 109-111 show that the Spike protein exists in various forms in infected Vero cells, 
5 with sizes of approx. 75kDa, 90kDa, ISOkDa and >250kDa. The Spike protein is cleaved (at 
least partially) either intracellulary or after release of the particles. 

If enzymatic cleavage of the mouse hepatitis coronavirus spike protein is inhibited then 
cell-cell fusion (syncytia formation) is also inhibited, but virus-cell fusion is not (de Haan et al. 
(2004) / Virol). Syncytia are observed in vivo in the lungs of SARS-infected patients, but are not 
10 seen in Vero cell cultures of the SARS-CoV. Inhibition of Spike protein cleavage may thus be 
used to prevent syncytia formation and related pathology, even though viral infectivity may not 
be blocked. 

Example 30: Purification of SARS protease 

Cells were grown at 37°C to mid-log phase and induced with 0.2% L-arabinose. Cells were 

15 harvested by centrifugation, and the cells resiispended in lysis buffer (LB) containing 20 mM 
Tris pH 7.5, 500 mM NaCl, 5% glycerol V/V, 0.05% Triton X-100, 5 mM PME, 5 mM 
imidazole, and complete protease inhibitors (— )EDTA. Benzonase was added to a final- 
concentration of 50U/ml of lysate. Cells were then lysed using two passes through a pre-chilled 
microfiuidizer. The lysate was clarified by high speed centrifugation at 44,000 x g. Clarified 

10 lysate was applied to a prepared Pharmacia chelating FF column charged with nickel sulfate. 
After application of the lysate the coliunn was washed with 5 column volumes of LB, followed 
by 5 column volumes of LB supplemented with 45 mM imidazole. The column was then eluted 
using LB supplemented with 250 mM imidazole. Purity of the isolated SARS protease was 50%. 
Fractions containing protease were pooled, adjusted to 5 mM EDTA, and then applied to a 

25 Superdex 200 gel filtiration colunrn equilibrated in 20 mM Tris pH 7.5, 150 mM NaCl, 5% VA^ 
glycerol, 0.05 % Triton X-100, and 5 mM DTT. Purity of the isolated SARS protease was 70%. 
Again, fractions containing the protease were pooled, and then stored at -80°C until used. 
Activity assay, mass spectrometry and westem blot analysis were used to positively identify the 
protein (FIG 133). All steps were carried out with pre-chilled buffers, and kept at 4''C for as • 

50 much of the preparation as possible. 

Westem of SARS Protease Purification Fractions 

Protocol: Briefly, protein concentration was based on Absorbance at 280 nm, and coomassie 
stained gel estimates of purity. Protein was run on a 4-20% gradient gel, and transferred to 
nitrocellulose. The blot was then blocked with 3% BSA, probed with Mouse IgG anti-pentaHis, 
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and then probed with a secondary antibody to Mouse IgG conjugated with HRP. The blot was 
visualized using an ECL kit (Pharmacia Biotech). The results are shown in Figure 133 where A 
is the sizing column pool loaded at 50, 100 and 200 ng of target protein and B is the immobilized 
metal affinity column pool loaded at 50, 100 and 200 ng of target protein. 

5 

Example 31: Continuous Fluorescence Resonance Energy Transfer (FRET) Enzyme Assay 
The peptide containing EDANS, the fluorescence donor, and DABCYL, the fluorescence 
quencher (DABCYIL-VNSTLQ VSGLRKrBDANS) was synthesized by Syn. Pep. (Dublin, CA). 
The peptide contains the cleavage site Gln-Ser in the middle. Meyers, G. et al. Handbook of 

10 Proteolytic Enzymes and Barrett, A et al Academic Press, London, 1998, 726-728. The 
proteolytic activity of SARS protease was followed kinetically by measuring the level of 
formation of cleaved product that contains the fluorescence donor, SGLRK-EDANS using the 
Hitachi fluorometer (F-4500 FL Spec.) set at 340 nm excitation and 490 nm emission wave 
length. 5 of 5 mM peptide stock in DMSO solution was added to the reaction mixture, 

15 containing 295 \k\ of standard buffer (75 mM Tris-Hcl, 25 mM NaOAc, 25 mM Bis-Tris, 25 mM 
glycine, 5 mM EDTA, and 1 mM EDTA, pH 7.4) and 100 ul of buffer or 100 ul of 3.6 uM 
protease stock solution. The kinetic curve was followed for 6 minutes (the reaction was linear 
with R2 value of 0.998 (FIG 134)). The formation of fluorescence (proteolytic reaction) is likely 
enzyme dependent, as concentration of enzyme was tripled three times as much fluorescence was 

20 formed in the:6 minutes time frame. 



It will be understood that the invention has been described by way of example only and 
modifications may be made whilst remaining within the scope and spirit of the invention. 

25 
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Table 1. US Patents and Published International Patrat Applications 



Publication 
Nfumber 


Title 


Publication 
Date 


US-3927216 


1 ,2,4-Triazol E-3-Carboxamides For Inhibiting Virus Infections 


12/16/1975 


US-4010269 


Aintiviral Quinazoline Compositions And Methods Of Use 


3/1/1977 


US-4065570 


Antiviral 5-(Substituted Benzal) Hydantoins 


12/27/1977 


US-4089965 


rhiazolylphenylguanidines As Antiriiinovirus Agents 


5/16/1978 


US-4122191 


Antirhinovirus Agents 


10/24/1978 


US-4192895 


Antirhinovirus Agents 


3/11/1980- 


US-4254144 


Substituted Benzonitriles Having Antiviral Activity 


3/3/1981 


US-4264617 


Antiviral 5-(Substituted Benzal) Hydantoins 


4/28/1981 


US-4287188 


^irine Derivatives 


9/1/1981 


US-4327088 


Phosphonooxy- Or Glycosyloxy-Substituted Acrylophenones, Compositions And Uses Thereof 


4/27/1982 


US^332820 


Substituted B enzonitriles Having Antiviral Activity 


6/1/1982 


US-4349568 


Sulfur-Substituted Diphenyl Ethers Having Antiviral Activity 


9/14/1982 


US-4352792 


3-Alkoxyflavone Antiviral Agents 


10/5/1982 


aS^371537 


Sulfur-Substituted Phenoxypyridines Having Antiviral Activity 


2/1/1983 


US-4423053 


Derivatives Of 2-Amino-5-(0-Sulphamidophenyl)-l,3,4-Thiadiazol As Antiviral Agents And A 
Process For The Preparation.Thereof 


12/27/1983 


US-4505929 


Sulfur-Substituted Diphenyl Ethers Havin^Antiviral Activity 


3/19/1985 


US-4526897 


3ypertensive Isoindolin-2-Yl-Aiiiinoimidazolines And Isoindolin-2-Yl-Guanidines 


7/2/1985 


US-4558134 


Certain Phenoxy-Pyridine-Carbonitriles Having Antiviral Activity 


12/10/1985 


US-4629729 


Bndowed With Anti-Viral Activity 2-Alkylamino-4,6-Dihalo Pyrimidines 


12/16/1986 


US-4636492 


Inhibition Of Viral Protease Activity By Peptide Halomethyl Ketones 


1/13/1987 


US-4652552 


Teh-apeptide Methyl Ketone Inhibitors Of Viral Proteases 


3/24/1987 


US-4724233 


rherapeutical Application Of Phosphonylmethoxyalkyl Adenines 


2/9/1988 


US-4738984 


Antirhinovirus Agents 


4/19/1988 . 


US-4847246 


Antivu-al Compositions Derived From Fireflies And Then- Metiiods Of Use 


7/11/1989 


US-4855283 


Novel Pharmaceutically Active N-(2-Aminoacylamido-2-Deoxy-Hexosyl)-Amides, -Carbamates 
And -Ureas 


8/8/1989 


US-4885285 


Phosphorus Compounds, Processes For Their Manufacture, And Their Use 


12/5/1989. 


US-4956351 


Antiviral Pharmaceutical Compositions Containing Cyclodextrins 


9/11/1990 


aS-5001125 


Anti-Virally Active Pyridazinamines 


3/19/1991 


US-5036072 


Antiviral Agent 


7/30/1991 


US-5070090 


Antipicorpaviral Herterocyclic-Substituted Morpholinyl Alkylphenol Ethers 


12/3/1991 


US-5 100893 


Antipicomaviral Pyridazinamines 


3/31/1992 


US-5 112825 


Antirhinoviral Heteroamine-Substituted Pyridazines 


5/12/1992 


US-5 157035 


Anti-Vu^lly Active Pyridazinamines 


10/20/1992 


US-5240694 


Combined Antiviral And Antimediator Treatment Of Common Colds 


8/31/1993 


US-5242924 


retrazolyl-(Phenoxy^And Phenoxyalkyl)-Piperidinylpyridazines As Antiviral Agents 


9/7/1993 


US-5278184 


Syntiietic Derivatives Of Pyrrole And Pyrrolidine Suitable For The Therapy Of Infections Caused 
By Rhinoviruses 


1/11/1994 


US-5364865 


Phenoxy- And Phenoxyalkyl-Piperidines As Antiviral Agents 


11/15/1994 


US-5453433 


Thiadiazoles And Antipicomaviral Compositions 


9/26/1995 


US-5492689 


Combined Virustatic Antimediator (GOV AM) Treatment Of Common Colds 


2/20/1996 


US-55 14679 


Therapeutic Phenoxyalklpyridazines And Intermediates Therefor 


5/7/1996 


US-55 14692 


Substituted Quinoline Derivatives Useful As Antipiconaviral Agents 


5/7/1996 


US-5523312 


Antipicomaviral Agents 


6/4/1996 


US-5545653 


Anti-Viral Compounds 


8/13/1996 


US-5552420 


Therapeutic Phenoxyalkylazoles And Phenoxyalkylazines 


9/3/1996 


luS-5567719 


Thiadiazoles And Their Use As Antipicomaviral Agents 


10/22/1996 
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aS-5580897 


1,2-Dithiins Having Antifungal Activity 


12/3/1996 


US-5618821 


rherapeutic Phenoxyalkylheterocycles 


4/8/1997 


US-5618849 


Orally Active Antiviral Compounds 


4/8/1997 


US-5648354 


1 ,2-Dithiins Having Anti&ngal Activity 


7/15/1997 


US-5650419 


Thiadiazoles And ITieir Use As Antipicomaviral Agents 


7/22/1997 


US-5693661 


Anti- Viral Compounds 


12/2/1997 


US-5721261 


rherapeutic Phenoxyalkylazoles And Phenoxyalkylazines 


2/24/1998 


US-5725859 


Plant-Based Therapeutic Agent With Virustatic And Antiviral Effect 


3/10/1998 


lJS-5750527 


Thiadiazoles And Their Use As Antipicomaviral Agents 


'5/197100R 


US-5750551 


Treatment For Viral Diseases 


*>/19/1 OOR 


US-5762940 


Methods And Compositions For Inhibiting Or Destroying Viruses Or Retroviruses 


finn OOR 

xyyo 


US-5763461 


Therapeutic Phenoxyalkylheterocycles 


fi/Q/1 OOR 


LFS-5821242 


Anti-Viral Compounds 


in/1'^/IQQR 


US-5821257 


Thiadiazoles And Their Uses As Antipicomaviral Agents 


1 0/1 VI OOR 


US-5821331 


Anti-Picomaviral Agents 


10/1*^/1 QOR 
IV// Ijl lyyo 


QS-5846986 


Therapeutic Phenoxyalkylazoles And Phenoxyalkylazines 


197R/100R 


US-5856530 


Antipicomaviral Compounds And Methods For Their Use And Preparation 


1/5/1999 


US-5891874 


Anti- Vu-al Compound 


4/6/1999 • 


US-5962487 


Antipicomaviral Compounds And Methods For Their Use And Preparation 


10/5/1999 


US-6004933 


Cysteine Protease Inhibitors 


x^^Xi Vyyy 


US-6020371 


Antipicomaviral Compounds Compositions Containing Them And Methods For Their Use 


mnms 


Ui>-oUo7374 


Anti- Viral Compounds 


7/11/2000 


US-61 14327 


Anti^Vural Compounds 


o/s/9nnn 


US-61 17844 


Method And Composition For Antiviral Therapy 


yl IZ/ZUUU 


US-61 94447 


Bis (Benzimidazole) Derivatives Serving As Potassium Blocking Agents 


7/97/9001' 


US-6214799 
US-6277891 


Antipicomaviral Compounds And Methods For Their Use And Preparation 
Nitric Oxide Inhibits Rhinoviras Infection 


d/1 n/9nni 


US-6294186 


Antimicrobial Compositions Comprising A Benzoic Acid Analog And A Metal Salt 


9/25/2001 


US-6331554 


Antipicomaviral Compounds, Compositions Containing Them And Methods For Hieir Use 


1 9/1 R/9nni ■ 


US-6358971 


Anti- Viral Compounds 


^/1 0/9 009 


US-6362166 


Antipicomaviral Compounds And Methods For Their Use And Preparation 




US-6414004 


3-Substihited 5-Aryl-4-Isoxazolecarboniti-iles Having Antiviral Activity 




US-6420591 


Carbamates And Compositions Thereof, And Methods For Their Use For Treating Cancer, 
Inflammation, Or A Viral Infection 


7/1 <;/'>fifv>- 


LrS-6469018 


Compounds 


10/22/2002 


US-6498178 


Inhibitors Of IMPDH Enzyme 


lZ/Z4/zUUz 


US-6514997 


Antipicomaviral Compounds And Compositions, Their Pharmaceutical Uses, And Materials For 
Their Synthesis 


9/4/900'? 


US-6525043 


Use Of Ion Channel Modulating Agents 


2/25^003 


US-6531452 


Antipicomaviral Compounds And Compositions, Their Pharmaceutical Uses, And Materials For 
Their Synthesis 


v^ 1 /9001 

Jl 1 l/ZUU J 


US-6534489 


Organophosphoms Compounds And The Use Theiwf 


v^ R/900"^ 

Jl X OIJtVK/J 


WOOO/06529 


Diketoacid-Derivatives As Inhibitors Of Polymerases 


2/10/2000 


WUUU/Z5 /yi 


Pyridin-4-Yl Or Pyrimidin-4-Yl Substituted Pyrazines 


5/11/2000 


WOOO/27423 


Metiiods And Compositions For Treating Common Cold Symptoms 


5/18/2000 


WOOO/34308 


Protein Transduction System And Methods Of Use Thereof 


fi/l ^/90on 


WOOO/39348 


Metiiods And Compositions For Identifying Protease Modulators 


7/6/2000 


WOOO/40243 
WOOO/50037 


*^ovel Compounds 


7/13/2000 


WOOO/56331 


^ih-osated And Nitrosylated Proton Pump Inhibitors, Compositions And Methods Of Use 
nhibitors Of Impdh Enzyme 


J/31/2000 
?/28/2000 


WOOO/56757 


mmunomodulatory Steroids, In Particular The Hemihydrate Of 16.AIpha.-Bromoepiandrosterone 


)/28/2000 


WOOO/66096 


hJew Indication For Use Of Antiepileptic Agents And Medicines 


1/9/2000 


WOOO/78746 


Antivual Agents 


2/28/2000 
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WOOl/00199 


Compounds Obtained From Salvia Species Having Antiviral Activity 


1/4/2001 


WOOl/00585 


Pyrazolidinol Compounds 


1/4/2001 


WOOl/02551 


Virus Like Particles, Their Preparation And Their Use Preferably In Pharmaceutical Screening 
And Functional Genomics 


1/11/2001 


WOOl/03681 


Use Of Flavones, Coumarins And Related Compounds To Treat Infections 


1/18/2001 


WOOl/05396 


Use Of Cobalt Chelates For Treating Or Preventing Virus Infection 


1/25/2001 


WOOl/10894 


Antipicomaviral Compounds And Con^sitions, Their Pharmaceutical Uses, And Materials For 
rheir Synthesis 


2/15/2001 


WOOl/19322 


Use Of Csaids In Rhinovirus Infection 


3/22/2001 


WOOl/19822 


Antiviral Agents 


3/22/2001 


WOOl/22920 


Colon And Colon Cancer Associated Polynucleotides And Polypeptides 


4/5/2001 


WOO 1/25188 


Novel Carbamates And Ureas 


4/12/2001 


WOOl/31016 


Processed Human Chemoldnes Pho-1 And Phc-2 


5/3/2001 


WOOl/37837 


3,4-Dihydro-(lh)-Qumazolin-2-Ones And Their Use As Csbp/P38 Kinase Inhibitors 


5/3I/200I 


WOOl/38312 


3,4-Dihydro-(l h)QuinazoIin-2-One Compounds As Csbp/P38 Kinase Inhibitors ' 


5/31/2001 


WOOl/38313 


3,4-Dihydro-(lh)Quinazolni-2-One Compounds As Csbp/P39 Kinase Inhibitors . 


5/31/2001 


WOOl/38314 


3,4-Djhydro-(lh)Qumazolin-2-One Compounds As Csbp/P38 Kinase Inhibitors 


p/31/2001 


WOOl/40189 


Antipicomaviral Compounds And Compositions, Thek Pharmaceutical Uses, And Mataials For 
rheir Synthesis 


6/7/2001. 


WOOI/49303 


Multivalent Electron Active Compositions And Methods Of Making And Using Same 


7/12/2001 ■ 


WOOl/60393 


Selective Destruction Of Cells Infected With Human Immunodeficiency Virus 


8/23/2001 


WOOl/62726 


2-Oxo-l-PyrroIidine Derivatives, Processes For Preparing Tbem And Their Uses 


8/30/2001 


WOOl/79167 


Antipicomaviral Conqwunds And Compositions, Their Phannaceutical Uses, And Materials For 

rheir Synthesis 


10/25/2001 


WOOl/90047 


Novel Mmp-2/Mmp-9 Inhibitors 


11/29/2001 


WOOl/90129 


Prophylactic And Therapeutic Treatment Of Infectious And Other Diseases With Mono- And 

Disaccharide-Based Compounds 


11/29/2001 


WOOl/92499 


Nucleic Acid Molecules Encoding A Protein Interacting With Ser/Thr Kinase Akt 


12/6/2001 


WOOl/93883 


rherapeutic Agents - lii 


12/13/2001 


WOOl/93884 


rherapeutic Agents - 1 


12/13/2001 


WOOl/93885 


rherapeutic Agents - li 


12/13/2001 


WOOl/96297 


Antipicomaviral Compounds And Compositions, Their Pharmaceutical Uses. And Materials For 

Fheir Synthesis 


12/20/2001 


WO02/04413 


Chiral Integrin Modulators And Methods Of Use Thereof 


1/17/2002 


WO02/11743 


Treatment Of Prostate Cancer 


2/14/2002 


WO02/12477 


Enhanced Replication Of Hcv Rna 


2/14/2002 


WO02/14343 


Immunosuppressive, Antiinflammatory And Analgetic Compounds 


2/21/2002 


WO02/24145 


Antiviral Substances From Plant Cuticular And Epicuticular Material 


3/28/2002 


WO02/28351 


[Recombinant Mucin Binding Proteins From Steptococcus Pneumoniae 


4/11/2002 


WO02/30442 


[Method For Treating Cytokine Mediated Hepatic Injury 


4/18/2002 


WO02/34771 


Nucleic Acids And Proteins From Streptococcus Groups A & B 


5/2/2002 


WO02/44737 


Diagnostic And Therapeutic Compositions And Methods Related To G Protein-Coupled Receptor 
(Gpcr) Anaphylatoxin C3a Receptor 


6/6/2002 


WO02/50045 


Antiviral Agents 


6/27/2002 


WO02/51413 


Macrocyclic Anti- Viral Compounds 


7/4/2002 


WO02/53138 


Treatment For Inhibiting Neoplastic Lesions 


7/11/2002 


WO02/57425 


Nucleoside Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


7/25/2002 


WO02/59083 


Novel Compounds 


8/1/2002 


WO02/60875 


Nicotinamide Biaryl Derivatives Useful As Inhibitors Of Pde4 Isozymes 


8/8/2002 


WO02/60898 


Thiazolyl-, Oxazolyl-, Pyrrolyl-, And Imidazolyl-Acid Amide Derivatives Useful As Inhibitors 
Of Pde4 Isozymes 


8/8/2002 


WO02/69903 


Nucleosides, Preparation Thereof And Use As Inhibitors Of Rna Viral Polymerases 


9/12/2002 


WO02/72022 


Substituted Tetracycline Compounds As Antifungal Agents 


9/19/2002 
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WO02/72031 


Substituted Tetracycline Compounds As Synergistic Antifungal Agents 


9/19/2002 


WO02/76939 


Cysteine Protease Inhibitors 


10/3/2002 


WO02/77021 


Streptococcus Pneumoniae Proteins And Nucleic Acids 


10/3/2002 


WO02/79401 


Novel Rgs9 Protein Bmdmg Interactions And Methods Of Use Thereof 


10/10/2002 


WO02/82041 


Production And Use Of Novel Peptide-Based Agents For Use With Bi-Specific Antibodies 


10/17/7009 


WO02/87465 


Compositions And Methods Of Double-Targeting Virus Infections And Cancer Cells 


11/7/2007 


WO02/87500 


Viral Enzyme Activated Prototoxophores And Use Of Same To Treat Viral Infections 


11/7/2002 


WO02/88091 


Inhibitors Of Human Rhinovinis 2a Cysteine Protease 


1 1/7/9009 


WO02/89832 


Pharmaceutical Compositions For Preventing Or Treating Thl And Th2 Cell Related Diseases By 
Modulating The Thl/Th2 Ratio. 


1 1 /1 4/9009 


WO02/92779 


Method For Enriching Tissues In Long Chain Polyunsaturated Fatty Acids 


11/21/2002 


WO02/94185 


Conjugates And Compositions For Cellular Delivery 


11/98/5009 


WO02/94868 


Staphylococcus Aureus Proteins And Nucleic Acids 




WO02/96867 


Inhibitors Of Protein Kinase For The Treatment Of Disease 


19/';/9009 


WO02/98424 


Novel Anti-Infectives 


12/12/2002 


WO03/04489 


Compositions And Methods For Inhibiting Prenyltransferases 


1 /I Knnni 
l/lo/ZUUo 


WO03/08628 


Enzymatic Nucleic Acid Peptide Conjugates .r " 


1/30/2003. 


WO03/15744 


Chitin Microparticles And Their Medical Uses 


IijL III\JM5 . 


WO03/20222 


Dioxolane And Oxathiolane Denvatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


3/13/2003 


WO03/20270 


Oxadiazolyl-Phenoxyalkylisoxazoles, Compositions Thereof And Methods For Their Use As 
Anti-Picomaviral Agents ' 


illillWi 


WO03/20271 


Oxadiazolyl-Phenoxyalkylisoxazoles, Compositions Thereof And Methods For Their Use As 
Anti-Picomaviral Agents 




WO03/20712 


Oxadiazolyl-Phenoxyalkylthiadiazoles, Compositions Thereof And Methods For Their Use As; 
Anti-Picomaviral Agents 




WO86/03412 


Improvements Relating To The Treahnent Control And Prevention Of Rhinovinis Infections 


an 0/1 QRf\ 


WO86/03971 


Antiviral Agents 


7/17/1986 


WO88/09669 


Avirulent Microbes And Uses Therefor 


12/15/1988 


WO92/03475 


Enterovirus Peptides 


3/5/1992 


WO92/22520 


Orally Active Antiviral Compounds 


19/9"^/! 009 


WO92/22570 


Inhibitors Of Picomavirus Proteases 


1 9/9 "^/l 009 


WO94/00012 


Wucleic Acids And Methods Of Use Thereof For Conb-olling Viral Patiiogens 




WO95/03821 


Prosaposin And Cytokine-Derived Peptides As Therapeutic Agents 


i/Q/1 one 


WO95/09175 


Ring-Expanded Nucleosides And Nucleotides . 


At/mooK . 
f/o/iyyj -. 


W095/11992 


Antiviral Compounds 


5/4/1995 


W095/31198 


rhiadiazoles And Their Use As Antipicomaviral Agents 


11/23/1995 


W095/31438 


Therapeutic Phenoxyalkylheterocycles 




W095/31439 


rherapeutic Phenoxyalkylpyridazines And Intermediates Therefor 


11/9'^/IQQ^ 
I If £.01 lyyj 


W095/31452 


rherapeutic Phenoxyalkylazoles AndPhenoxyalkylazines 


11/23/1995 


W095/34595 


Antiviral Dendrimers 


9/91/1 Q0< 


WO95/35103 


\ Pharmaceutical Composition For The Prevention And/Or Treatment Of Viral Infections And 
Dptionally Inflammations As Well As A Metiiod For The Treatment Thereof 


9/9 R/1 COS 
IZiioi lyyj 


WO96/05836 


Methods Of Treating Cold Symptoms Using Pentoxifylline 


?/90/1 QQfi 
u £,yj lyyo 


WO96/05854 


combination Preparation, Containing Cyclosporin A Or Fk506 Or Rapamycin And A Xanthine 
Derivative 


>/90/1 OOfi 


WO96/09822 


\ntipicomaviral Agents 


1/4/1 OOfi 


W096/11211 


Selective Inhibition Of Litemally Initiated Rna Translation 


/I S/100(^ 
H lol lyyO 


W096/22689 


Vlultiple Component Rna Catalysts And Uses Thereof 


5/1/1996 


WO96/40641 


Sulfonamide Denvatives As Cell Adhesion Modulators 


2/19/1996 


WO97/08553 


rargeting Of Protems To The Cell Wall Of Gram-Positive Bacteria i 


mm 


W097/34566 I 


ilecbx)philic Ketones For The Treatment Of Herpesvinis Infections c 


/25/1997 


W097/41137 [ 


Jse Of Anthocyanidin And Anthocyanidin Derivatives ] 


1/6/1997 


WO97/43305 I 


nhibitors Of Picomavinis 3c Proteases And Methods For Their Use And Preparation 1 


1/20/1997 
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WO97/47270 


Novel Anti-Viral Compounds 


12/18/1997 


WO98/03572 


Antiviral Linear Polymers 


1/29/1998 


WO98/07745 


Compositions And Methods For Treating Infections Using Analogues Of Indolicidin 


2/26/1998 


W098/11778 


Antimicrobial Treatment For Herpes Simplex Virus And Other Infectious Diseases 


3/26/1998 


W098/22495 


Antikinin Compounds And Uses Thereof 


5/28/1998 


W098/31363 


Anti- Viral Compounds 


7/23/1998 


W098/31374 


Method Of Treating Rhinoviral Infections 


7/23/1998 


W098/32427 


Therapeutic Treatment And Prevention Of Infections With A Bioactive Material Encapsulated 
Within A Biodegradable-Biocompatible Polymeric Matrix 


7/30/1998 


WO98/34601 


Method For Inhibiting Intracellular Viral Replication 


8/13/1998 


W098/42188 


Antimicrobial Prevention And Treatment Of Human Immunedeficiency Virus And Other 
Infectious Diseases 


10/1/1998 


WO98/43950 


Antipicomaviral Corapouds, Compositions Containmg Them, And Methods For Their Use 


10/8/1998 


WO98/49190 


Substituted Oxadiazole Cysteine Protease Inhibitors 


11/5/1998 


WO98/55120 


Anti-Viral Compounds 


12/10/1998 • 


WO99/30699 


Modulators Of Cysteine Protease 


6/24/1999 


W099/31122 


Antipicomaviral Compounds And Methods For Their Use And Preparation 


6/24/1999 


Vvuyy/j4ji / 


Cysteine Protease Inhibitors 


10/28/1999 


W099/55663 


Inhibitors Of Impdh Enzyme 


11/4/1999 


W099/57135 


Antipicomaviral Compounds, Their Preparation And Use 


11/11/1999 


W099/59587 


Anti- Viral Compounds 


11/25/1999 


W099/61437 


Novel 2-Alkyl Substituted Imidazole Compounds 


12/2/1999 



Table 2 . US Patents and Published International Patent Applications 



Publication 
Number 


Title 


Publicatio 
nDate 


WO02/69903 


Nucleosides, Preparation Thereof And Use As Inhibitors Of Rna Viral Polymerases 


9/12/2002 


WO02/48116 


Inhibitors Of Hepatitis C Virus Ns3 Protease 


6/20/2002 


WO02/48157 


tnridazolidinones And Their Related Derivatives As Hepatitis C Virus Ns3 Protease Inhibitors 


6/20/2002 


WO02/61048 


In Vitro System For Replication Of Rna-Dependent Rna Polymerase (Rdrp) Viruses 


8/8/2002 


WO03/02518 


Novel 2,4-Difluorobenzamide Derivatives As Antiviral Agents 


1/9/2003 


WO02/79187 


Methoxy-l,3,5-Triazine Derivatives As Antiviral Agents 


10/10/2002 


WOOl/78648 


6-Methylnicotinamide Derivatives As Antiviral Agents 


10/25/2001 


WOOl/12214 


MYCOPHENOLATE MOFETIL IN ASSOCIATION WITH PEG-IFN-.Alpha. 


2/22/2001 


WO02/100415 


^'-Substituted Nucleosides . 


12/19/2002 


WO02/18404 


Nucleoside Derivatives 


3/7/2002 


WO02/94289 


Antiviral Nucleoside Derivatives 


11/28/2002 


WO96/39500 


Oligonucleotides Specific For Hepatids C Viras 


12/12/1996 


WO03/00713 


Nucleoside Compounds In Hcv 


1/3/2003 


WOOl/60381 


Nucleoside Analogs With Carboxamidine-Modified Bicyclic Base 


8/23/2001 


WO02/03997 


Pyndo[2,3-DlPyrimidine And Pvrimidof4,5-D]Pyrimidine Nucleosides 


1/17/2002 


W097/26883 


Modulation Of Thl/rh2 Cytokine Expression By Ribavirin3 And Ribavirins Analogs In 
Activated T-Lymphocytes 


7/31/1997 


WO03/26589 


Methods And Compositions For Treating Hepatitis C Viras Using 4'-Modified Nucleosides 


4/3/2003 


WO03/26675 


Methods And Compositions For Treating Flaviviruses And Pestiviruses Using 4'-Modified 
Nucleoside 


4/3/2003 


WO97/30067 


Sugar-Modified Gapped Oligonucleotides 


8/21/1997 


WOOl/47883 


Fused-Ring Compounds And Use Thereof As Drugs 


7/5/2001 


WO03/00254 


Fused Cyclic Compounds And Medicinal Use Thereof 


1/3/2003 


WO02/100354 


Pyrrolo[2,3-D}Pyrimidine Nucleoside Analogs 


12/19/2002 


WOOl/55111 


Biaryl Compounds, Their Preparation And Their Use In Therapy 


8/2/2001 
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WOO 1/16149 


2-Azapurine Compounds And Their Use 


3/8/2001 


WOOl/85770 


Sentinel Vims li 


11/15/2001 


WO02/12263 


Nucleic Acid Bindmg Compounds Containing Pyrazolo[3,4-D]Pyriniidine Analogues Of Purin 
2,6-Diamine And Their Uses 


- 2/14/2002 


IP 2001-247550 A2 


Condensed Ring Compound And Its Medicinal Use 


9/11/2001 


6210675 


PT-NANB Hepatitis Polypeptides 


4/3/2001 


6451991 


Sugar-Modified Gapped Oligonucleotides 


9/17/2002 


5830455 


Method Of Treatment Usmg A Therapeutic Combination Of a-Interferon And Free Radical 
Scavengers 


11/3/1998 


5908621 


Polyethylene Glycol Modified Interferon Therapy 


D/i/iyyy 


5990276 


Synthetic Inhibitors Of Hepatitis C Virus NS3 Protease 


11/23/1999 


6172046 


Combination Therapy For Eradicatmg Detectable HCV-RNA In Patients Having Chronic 
Hepatitis C Infection 


1/9/2001 


6177074 
6326137 


Polyethylene Glycol Modified Interferon Therapy 
Hepatitis C Virus Protease-Dependent Chimeric Pestivirus 


1/23/2001 


6434489 
5461605 


Compositions Of Hepatitis C Viras NS5B Polymerase And Methods For CrvstalliziuE Same 
Continuous Low-Dose Cytokine Infusion Therapy 


12/4/2001 
8/13/2002 
10/8/2002 


6472373 


Combmation Therapy For Eradicating Detectable HCV-RNA In Antiviral Treatment Naive 
Patients Having Chronic Hepatitis C Infection 


10/29/2002 


DDZ4D /O 


Polyethylene Glycol Modified Interferon Therapy - ; 


2/25/2003 


WOOO/37097 


Ribavirin-Interferon Alfa Induction Hcv Combination Therapy 


6/29/2000 


WOOO/37110 


Wbavu-in-Pegylated Interferon Alfa Induction Hcv Combination Therapy 


6/29/200Q 


WOOO/62799 


Hcv Combination Therapy. Containmg Ribavirin In Association With Antioxidants 


10/26/2000 


WOO 1/58929 


Azapeptides Usefiil In The Treatment Of Heoatitis C 


8/16/2001 


WO02/32414 


Ribavmn-Pegylated Interferon Alfa Hcv Combination Therapy 


4/25/2002 


WO03/24461 


Hcv Combination Therapy 


3/27^003 


WO93/20835 


Treatinent Of Hepatitis With Gm-Csf 


10/28/1993 


WO96/36702 


Soluble, Active Hepatitis C Virus Protease 


11/21/1996 


WO97/16204 


Continuous Low-Dose Cytokine Infusion Therapy 


5/9/1997 


WO97/43310 
WO98/48840 


Syntiietic Inhibitors Of Hepatitis C Virus Ns3 Protease 

Polyethylene Glycol-Inteiferon Alpha Coniusates For Therapy Of Infection 


11/20/1997 


W099/15194 


Combmation Therapy For Eradicating Detectable Hcv-Rna In Patients Having Chronic ' 
Hepatitis C Infection 


11/5/1998 
4/1/1999 


W099/59621 

WO02/100846 
WO02/100851 


Combmation Therapy Comprising Ribavirin And Interferon Alpha In Antiviral TYeatment 
Naive Patients Having G Chronic Hepatitis C Infection 

Compounds And Metiiods For The Treatinent Or Prevention Of Haviviras Infections 
Compounds And Methods For The Treatinent Or Prevention Of Havivirus Infections 


11/25/1999 

12/19/2002 
12/19/2002 


5241053 
5556946 
6087484 

5830905 
6316492 


Kuscd Proteins ConiDrisih? Glvconrntwn oh nf ttqv i AvxAi td 
[nterleukin-2Aru:al Antigen Protein Chimers 

Enhancement Of Ribozyme Catalytic Activity By A 2'-0-Substiftited Facilitator 
Dligonucleotide 

compounds, Compositions And Metiiods For Treatinent Of Hepatitis C 


S/3 1/1993 
J/17/1996 
7/11/2000 

1/3/1998 


6440985 
WOOO/10573 


Methods For Treating Or Preventing Viral Infections And Associated Diseases 
Methods For Treating Viral Infections 

-ompounds. Compositions And Metiiods For Treating Or Preventing Viral Infections And 
\ssociated Diseases 


1/13/2001 
5/27/2002 
/2/2000 


WOOO/13708 P 
WOOO/18231 P 
W099/51781 f 
6323180 I 


Methods For Treating Or Preventing Viral Infections And Associated Diseases ^ 
i^ethods For Treating Or Preventing Viral Infections And Associated Diseases i 
lepatitis C Virus Ns5b Compositions And Methods Of Use Thereof i 
lepatitis C Inhibitor Tri-Peptides ~" ^ 


/1 6/2000 
/6/2000 
0/14/1999 
1/27/2001 


6143715 I 
6329379 [ 
6329417 F 


lepatitis C Inhibitor Peptide Analogues " ] 
[epatitisC Inhibitor Tri-Peptides , 
[epatihsC Inhibitor Tri-Pepitides [ 
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6410531 


Hepatitis CInhibitor Tri-Peptides 


6/25/2002 


6420380 


Hepatitis C Inhibitor Tri-Peptides 


7/16/2002 


6448281 


Viral Polymerase Inhibitors 


9/10/2002 


6479508 


Vu^l Polymerase Inhibitors 


11/12/2002 


6534523 


Hepatitis C Inhibitor Tri-Peptides 


3/18/2003 


WOOO/09543 


Hepatitis C Inhibitor Tri-Peptides 


2/24/2000 


WOOO/09558 


Hepatitis C Inhibitor Peptides 


2/24/2000 


WO00y59929 


Macrocyclic Peptides Active Against Hie Hepatitis C Virus 


10/12/2000 


WO02/04425 


Viral Polymerase Inhibitors 


1/17/2002 


WO02/70739 


Hcv Polymerase Inhibitor Assay 


9/12/2002 


WO03/07945 


Viral Polymerase Inhibitors 


1/30/2003 


WO03/10140 


Viral Polymerase Inhibitors 


2/6/2003 


WO03/10141 


Viral Polymerase Inhibitors 


2/6/2003 


WO99/07734 


Hepatitis C Inhibitor Peptide Analogues 


2/18/1999 


WOOl/16379 


hepatitis C Virus Replication Inhibitors 


3/8/2001 


WO02/07761 


Inhibiting Hepatitis C Virus Processing And Replication 


1/31/2002 


WO02/57287 


Nucleoside Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


7/25/2002 


WO02/57425 


Nucleoside Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


7/25/2002 


WO02/70651 


Viral Reporter Particles 


mmm 


WO03/20222 
PCT/US2003/ 
041493 


Dioxolane And Oxathiolane Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 
rhiosemicarbazones as Anti-Vinds and Immunopotentiators 


3/13/2003 
01/10/2003 



Table 3 : US Patents and published international patent applications relating to inhalation 



technology for the delivery of antiviral compounds of the invention. 



Publication 
Number 


Title 


E*ubIication 
Date 


5740794 


Apparatus and methods for dispersing dry powder medicaments 


4/21/1998 


5775320 


Method and device for delivering aerosolized medicaments 


7/7/1998 


5785049 


Method and apparatus for dispersion of dry powder medicaments 


7/28/1998; 


5814607 


Pulmonary delivery of active fragments of parathyroid hormone 


9/29/1998 


5826633 


Powder filling systems, apparatus and methods 


10/27/1998 


5458135 


Method and device for delivering aerosolized medicaments 


10/17/1995 


5607915 


Pulmonary delivery of active fragments of parathyroid hormone 


3/4/1997 


5654007 


Methods and system for processing dispersible fine powders 


8/5/1997 


5922354 


Methods and system for processing dispersible fine powders 


7/13/1999 


5928469 


Process for storage of materials 


7/27/1999 


5976574 


Processes for spray drying hydrophobic drugs in organic solvent suspensions 


11/2/1999 


5985248 


JProcesses for spray drying solutions of hydrophobic drugs and compositions thereof 


11/16/1999 


5994314 


Compositions and meUiods for nucleic acid delivery to the lung 


11/30/1999 


5997848 


Methods and compositions for pulmonary delivery of insulin 


12/7/1999 


6001336 


Processes for spray drying aqueous suspensions of hydrophobic drugs and compositions thereof 


12/14/1999 


6019968 


Dispersible antibody compositions and meOiods for their preparation and use 


2/1/2000 


6051256 


Dispersible raacromolecule compositions and methods for theur preparation and use 


4/18/2000 


6071428 


Stable compositions 


6/6/2000 


6077543 


Systems and processes for spray drying hydrophobic drags with hydrophilic exdpients 


6/20/2000 


6080721 


Pulmonary delivery of active fragments of parathyroid hormone 


6/27/2000 


6089228 


Apparatus and methods for dispersing dry powder medicaments 


7/18/2000 


6103270 


Methods and system for processing dispersible fine powders 


8/15/2000 


6123936 


Methods and compositions for the dry powder formulation of interferons 


9/26/2000 
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6136346 


Powdered pharmaceutical formulations having improved dispersibility 


10/24^000 


6138668 


Method and device for delivering aerosolized medicaments 




6165463 


Dispersible antibody compositions and methods for their preparation and use 


li/ZO/ZUUU 


6182712 


Power filling apparatus and methods for their use 


ao/Zvul 


6187344 


Powdered pharmaceutical formulations having improved dispersibility 


1115/lUUl 


6207135 


Gaseous microparticles for ultrasonic diagnosis and process for their production 


3/27/2001 


6231851 


Methods and compositions for the dry powder formulation of interferons 


5/15/2001 


6257233 


Dry powder dispersing apparatus and methods for their use 


//iU/2001 


6258341 


Stable glassy state powder formulations 


7/10/2001 


6267155 


Powder jRlling systems, apparatus and methods 


7/31/2001 . 


6294204 


Method of producing morphologically uniform microcapsules and microcapsules produced bv 
this method ^ 


9/25/2001 


6303582 


Compositions and methods for nucleic acid delivery to the lung 




10/16/2001 


6309623 


Stabilized preparations for use in metered dose inhalers 


10/30^001 


6309671 


Stable glassy state powder formulations 


10/30/2001 


6358530 


Powdered pharmaceutical formulations having improved dispersibility 


3/19/2002 


6365190 


Systems and processes for sorav drvine hvdronhobic Hni9« with hvHmnViiiir> e^vr-ir^i^^*^ 


4/2/2002 


6372258 


Methods of spray-drying a drug and a hydrophobic amino acid 


4/16/2002 


6423344 


Dispersible macromolecule compositions and methods for their preparation and use 


7/23/2002 


5426210 


Storage of materials 


7/30/2002 


6433040 


Stabilized bioactive preparations and methods of use 


8/13^002 


6440337 


Method and apparatus for the formation of particles 


8/27/2002 


EUE37872 


Storage of materials 


10/8/2002 


6479049 


Methods and compositions for the dry powder formulation of interferons 


11/12/2002 


6503411 


Stable compositions 


1/7/2003 


6509006 


Devices compositions and methods for the nulmonarv dplivmv nf nfmcnU-rpA moM^^^^^tr 


1/21/2003 


6514496 


Dispersible antibody compositions and methods for their preparation and use 


2/4/2003 


5518239 


iry powder compositions having improved dispersivity 


2/11/2003. 


6543448 


apparatus and methods for dispersing dry powder medicaments 


*/o/ZUU3 


6546929 


bry powder dispersing apparatus and methods for their use 


4/15/2003 


WO 00/15262 


dry powder active agent pulmonary delivery 


3/23/2000 


WO 93/00951 


method and device for delivering aerosolized medicaments 


1/21/1993 


WO 94/07514 


julraonary delivery of active fragments of parathyroid hormone 


4/14/1994 


WO 95/24183 


methods and compositions for pulmonary delivery of insulin 


9/14/1995 


WO 95/31479 


methods and compositions for the dry powder formulation of interferons 


11/23/1995 


WO 96/09085 


apparatus and methods for dispersing dry powder medicaments 


3/28/1996 


WO 96/32096 


)owdered pharmaceutical formulations having improved dispersibility 


10/17/1996 


WO 96/321 16 


impositions and methods for nucleic acid delivery to the lung 


10/17/1996 


WO 96/32149 


julmonary delivery of aerosolized medicaments 


10/17/1996 


WO 96/32152 


)ulmonary administration of dry powder alpha 1-antitrypsm 


0/17/1996 


WO 96/40068 


methods and system for processing dispersible fine powders 


12/19/1996 


WO 97/41031 


Dowder filling systems, apparatus and methods 


1/6/1997 


WO 97/41833 


iispersible macromolecule compositions and methods for their preparation and use 


11/13/1997 


WO 98/16205 


table glassy state powder formulations 


J/23/1998 


WO 98/29096 


lerosolized hydrophobic dmg 


7/9/1998 


WO 98/29098 


. . o^iajr ui^fiiig aqucuua ^lU^)pc^sIons or nyoropnooic drugs with hydrophilic 
xcipients and compositions prepared by such processes 


V9/1998 


WO 98/29140 [ 

c 


)rocesses and compositions for spray drying hydrophobic drugs in organic solvent suspensions 
f hydrophilic excipients 


/9/1998 


WO 98/29141 f 

c 


rocesses for spray drymg solutions of hydrophobic drugs with hydrophilic excipients and 
ompositions prepared by such processes 


/9/1998 


WO 99/19215 D 


owder filling apparatus and method ^1 


/22/1999 


WO 99/42124 I 


quid crystal forms of cyclosporin g 


/26/1999 
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WO 99/47196 


aerosolized active agent delivery 




WO 99/62495 


dry powder dispersing apparatus and methods for their use 


10/Q/1OQQ 


WO 00/21594 


Elovy resistance modulated aerosolized active agent delivery 


't/ZU/ZUUl/ 


WO 00/61178 


pulmonary administration of dry powder formulations for treating infertility 


in/iQ/onnn 

iu/iy/zuuu 


WO 00/72904 


apparatus and method for dispensing metered amount of aerosolized medication 


\innf\nn 


WO 01/00263 


systems and methods for aerosolizing phannaceutical formulations 




WO 01/00312 


spray drying process for preparing dry powders 


1 M/onni 

l/4/zUUl 


WO 01/32144 


iry powder compositions having improved dispersivity 


s/in/9nni 


WO 01/43529 


receptacles to facilitate the extraction of powders 


am nf\ni 
o/Zi/ZUUl 


WO 01/43530 


systems and mediods for extracting powders from receptacles 


6/21/2001 


WO 01/43802 


systems and methods for treating packaged powders 




WO 01/44764 


systems and methods for non-destructive mass sensing 


E>/Zl/zUUI 


WO 01/87393 


systems, devices and methods for opening receptacles having a powder to be fluidized 


11/22/2001 


WO 01/93932 


oclcout mcchflnism for s^rosol dm^ delivery devices 


iZ/li/2UUl 


WO 02/09669 


apparams and process to produce particles having a narrow size distribution and particles made 
hereby 


2/7/2002 




nhaleable spray dried 4-helix bundle protein powders having mimmized aggregation 




2/14/2002 


WO 02/49619 


nduced phase transition method for the production of microparticles containing hydropfailic 


Oil ifZUliZ 




inauceQ pnase transition method tor the production of microparticles containing hydrophobic 

active agents 


5/27/2002 


WO 02/54868 


puhnonary delivery of polyene antifungal agents 


7/18/2002 


WO 02/87542 


novel methods and compositions for delivering macromolecules to or via the respiratory tract 


11/7/2002 


WO 02/100548 


centrifuged rotating drum for treating cohesive powders ■■ 


12/19/2002 


WO 03/00326 


powder aerosolization apparatus and method 


1/3/2003 


WO 03/00329 


flow regulator for aerosol drag delivery device and methods 


1/3/2003 



TABLE 4: Forward and reverse primers for nucleic acid amplification of SARSV 



Pair Number 


Forward primer SEQ ID NO 


Forward Primer Start 


Forward Primer Stop 


Forward Primer Tm 


Forward Primer %GC 


Reverse primer SEQ ID NO 


Reverse Primer Start 


Reverse Primer Stop 


Reverse Primer Tm 


Reverse Primer %GC 


! Primer TmDiff 


Product Length 


Product Tm 


Product %GC 


Anneal Score | 


Optimum Anneal Temp 


1 


1021 


12726 


12746 


51.3 


47.6 


3521 


12996 


12977 


50.2 


40 


1 


271 


75 


42.8 


26 


52.6 


2 


1022 


12236 


12256 


51.2 


42.9 


3522 


12993 


12975 


51.4 


47.4 


0.2 


758 


76.4 


42.5 


26 


54 


3 


1023 


12373 


12391 


50.8 


47.4 


3523 


12993 


12975 


51.4 


47.4 


0.6 


'621 


76.4 


43 


26 


53.8 


4 


1024 


12236 


12256 


51.2 


42.9 


3524 


12996 


12977 


50.2 


40 


0.9 


761 


76.4 


42.3 


26 


53.6 


5 


1025 


12373 


12391 


50.8 


47.4 


3525 


12996 


12977 


50.2 


40 


0.5 


624 


76.4 


42.8 


26 


53.6 


6 


1026 


12726 


12746 


51.3 


47.6 


3526 


12993 


12975 


51.4 


47.4 


0.1 


268 


75.1 


43.3 


26 


53.1 


7 


1027 


2671 


2692 


52.1 


40.9 


3527 


3185 


3164 


51 


45.5 


1.2 


515 


75.6 


41.6 


26 


53.3 


8 


1028 


28942 


28961 


50.2 


45 


3528 


29298 


29280 


51.4 


52.6 


1.2 


,357 


76.4 


44.8 


26 


53.6 


9 


1029 


19801 


19819 


53.2 


52.6 


3529 


19922 


19901 


51.5 


45.5 


1.7 


,122 


72.2 


43.4 


26 


51.1 


10 


1030 


19800 


19817 


50.4 


50 


3530 


19921 


19901 


50.2 


47.6 


0.3 


,122 


72.2 


43.4 


26 


50.7 


11 


1031 


9930 


9948 


51.5 


52.6 


3531 


10605 


10588 


51.1 


50 


0.4 


676 


75.8 


41.3 


27 


53.5 


12 


1032 


9933 


9952 


50.9 


45 


3532 


10605 


10588 


51.1 


50 


0.2 


673 


75.8 


41.2 


27 


53.4 


13 


1033 


9930 


9949 


52.2 


50 


3533 


10605 


10588 


51.1 


50 


1.1 


676 


75.8 


41.3 


27 


53.5 
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28 
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41 
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_ 
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40 
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28 
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45 


1065 
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40 
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40 


1.5 


402 






__28 
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66 
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75.9 


42.3 


42 


53.7 


2440 


3460 


3089 


3110 


51.8 


45.5 


5960 


3188 


3166 


51.6 


43.5 


0.2 


100 


. 72 


46 


42 


51 


2441 


3461 


18979 


19000 


51.6 


45.5 


5961 


19215 


19194 


50.2 


40.9 


1.4 


237 


73.5 


40.1 


42 


51.6 


2442 


3462 


26421 


26441 


51.5 


42.9 


5982 


26900 


26882 


51.5 


52.6 


0.1 


480 


77.5 


46.2 


42 


54.8 


2443 


3463 


26421 


26441 


51.5 


42.9 


5963 


26828 


26810 


52.9 


52.6 


1.4 


408 


76.6 


44.9 


42 


54.2 


2444 


3464 


11540 


11557 


50.4 


50 


5964 


11826 


11802 


51.3 


40 


0.8 


287 


74.4 


: 41.1 


42 


52.3 


2445 


3465 


26421 


26441 


51.5 


42.9 


5965 


26695 


26678 


50.5 


50 


1 


275 


74.9 


42.5 


42 


52.7 


2446 


3466 


11540 


11557 


50.4 


50 


5966 


11819 


11798 


50.3 


40.9 


0.1 


280 


74.3 


41.1 


42 


52.2 


2447 


3467 


11540 


11557 


50.4 


50 


5967 


11817 


11797 


50.4 


42.9 


0.1 


278 


74.3 


41 


42 


52.2 


2448 


3468 


23841 


23859 


50.5 


52.6 


5968 


24515 


24494 


50.4 


40.9 


0.1 


675 


76.1 


41.9 


42 


53.5 


2449 


3469 


3055 


3077 


52.8 


43.5 


5969 


3495 


3473 


51.8 


43.5 


0.9 


^441 


76 


43.1 


42 


53.9 


2450 


3470 


3795 


3813 


52.1 


52.6 


5970 


4565 


4542 


53.9 


41.7 


1.8 


771 


75.6 


40.3 


42 


53.6 


2451 


3471 


11540 


11560 


53.2 


47.6 


5971 


11984 


11966 


53 


52.6 


0.2 


445 


75.1 


40.7 


42 


53.6 


2452 


3472 


11541 


11561 


50.9 


42.9 


5972 


12165 


12147 


51.2 


47.4 


0.4 


625 


75.7 


41.3 


42 


53.4 


2453 


3473 


3795 


3815 


54.6 


52.4 


5973 


4318 


4294 


54.4 


40 


0.2 


524 


75.5 


41.2 


42 


54.3 


2454 


3474 


7723 


7741 


52.2 


52.6 


5974 


7853 


7833 


50.7 


47.6 


1.5 


131 


71.3 


40.5 


42 


50.2 


2455 


3475 


3055 


3075 


51.8 


47.6 


5975 


3504 


3485 


50.4 


45 


1.4 


450 


76.1 


43.1 


42 


53.5 


2456 


3476 


3055 


3074 


51.1 


50 


5976 


3494 


3473 


50.4 


40.9 


0.7 


440 


76 


43 


42 


53.4 


2457 


3477 


26421 


26441 


51.5 


42.9 


5977 


26651 


26631 


50.2 


42.9 


1.3 


231 


73.8 


41.1 


42 


51.8 


2458 


3478 


28109 


28130 


50.2 


40.9 


5978 


28672 


28654 


50.6 


52.6 


0.4 


564 


79.9 


51.6 


42 


56.1 


2459 


3479 


3055 


3074 


51.1 


50 


5979 


3504 


3485 


50.4 


45 


0.7 


450 


76.1 


43.1 


42 


53.5 


2460 


3480 


12232 


12250 


51.9 


52.6 


5980 


12412 


12392 


50 


42.9 


1.9 


181 


73.2 


41.4 


42 


51.3 


2461 


3481 


3034 


3053 


50.3 


50 


5981 


3210 


3190 


50.5 


47.6 


0.2 


177 


74.9 


45.8 


42 


52.6 


2462 


3482 


3034 


3053 


50.3 


50 


5982 


3494 


3473 


50.4 


40.9 


0.1 


, 461 


76.1 


43.2 


42 


53.5 


2463 


3483 


3034 


3053 


50.3 


50 


5983 


3504 


3485 


50.4 


45 


0.1 


471 


76.2 


43.3 


42 


53,5 


2484 


3484 


12236 


12256 


51.2 


42.9 


5984 


12498 


12480 


50 


47.4 


1.1 


263 


74.6 


42.2 


42 


52.4 


2465 


3485 


12352 


12375 


52.9 


41.7 


5985 


12724 


12705 


52.4 


55 


0.5 


373 


75.7 


42.9 


42 


53.8 
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2466 


3486 


26421 


26441 


51 .£ 


42.S 


6986 


26585 


26567 


51 


47.4 


0.£ 


16£ 


72.2 


4C 


42 


I 50.9 


2467 


3487 


3031 


3051 


51 .C 


52.4 


5987 


3503 


3484 


51.5 


5C 


0.1 


47C 


76.C 


4?,f 


4? 


53.9 


2468 


3488 


18704 


18724 


50.£ 


47.6 


5988 


19480 


1945S 


50.3 


40.S 


0.4 


777 


75.5 


40.2 


42 


53.1 


2469 


3489 


3016 


3036 


50.2 


42.9 


5989 


3646 


3625 


52 


40.9 


1.£ 


631 


76.4 


42.e 


42 


53.6 


2470 


3490 


2823 


2844 


50.4 


45.5 


5990 


3053 


3034 


50.3 


50 


0.2 


231 


74 


41.6 


42 


52 


2471 


3491 


12366 


12384 


51.7 


52.6 


5991 


12994 


12976 


50.3 


47.4 


1.3 


629 


76.4 


42.9 


42 


53.7 


2472 


3492 


12366 


12384 


51.7 


52.6 


5992 


12992 


12974 


51.2 


52.6 


0.5 


627 


76.5 


43.1 


42 


54 


2473 


3493 


2823 


2844 


50.4 


45.5 


5993 


3056 


3037 


52.1 


55 


1.6 


234 


74.2 


41.9 


42 


52.2 


2474 


3494 


2522 


2541 


51.4 


45 


5994 


2672 


2654 


50,9 


52.6 


0.5 


151 


7,'^,3 


48.3 


42 


53 


2475 


3495 


2522 


2541 


51.4 


45 


5995 


2675 


2656 


50.4 


50 


1 


154 


75.2 


48.1 


_42 


52.9 


2476 


3496 


2429 


2447 


50.2 


47.4 


5996 


3056 


3037 


52.1 


55 


1.9 


6?R 


76.3 


42.7 


42 


53.6 


2477 


3497 


2429 


2447 


50.2 


47.4 


699/ 


3190 


3169 


50.7 


45.5 


0.5 


762 


76.6 


42.9 


42 


53.8 


2478 


3498 


27436 


27455 


52.7 


45 


5998 


27541 


27521 


51.7 


47.6 


1 


106 


72.1 


45.3 


42 


51.1 


2479 


3499 


2429 


2447 


50.2 


47.4 


5999 


3192 


3171 


51.9 


50 


1.7 


764 


76.7 


43.1 


42 


538 


2480 


3500 


2427 


2445 


52.1 


52.6 


6000 


3056 


3037 


62.1 


55 


0 


630 


76.4 


42.9 


42 


54.2 


2481 


3fa01 


27389 


27407 


50.6 


47.4 


6001 


27541 


27521 


51.7 


47.6 


1.1 


153 


73.2 


43.1 


42 


51.5 


2482 


3502 


2427 


2445 


52.1 


52.6 


6002 


3190 


3169 


50.7 


45.5 


1.4 


764 


76.7 


43.1 


42 


54 


2483 


3503 


18616 


18636 


51.4 


47.6 


6003 


19316 


19295 


50 


40.9 


1.4 


701 


75.4 


40.2 


42 


52.9 


2484 


3504 


2377 


2395 


52.4 


52.6 


6004 


2672 


2653 


51.6 


50 


08 


296 


77 


47.3 


42 


54.5 


2485 


3505 


18591 


18611 


51.7 


42.9 


6005 


19216 


19195 


50.2 


40.9 


1.4 


626 


75.7 


41.1 


42 


53.1 


2486 


3506 


12366 


12384 


51.7 


52.6 


6006 


12739 


12718 


51 


40.9 


0.7 


374 


75.6 


42.8 


42 


53.3 


2487 


3507 


2377 


2395 


52.4 


52.6 


6007 


2672 


2654 


50.9 


52.6 


1.5 


296 


77 


47.3 


42 


54.3 


2488 


3508 


16982 


17001 


51.2 


55 


6UU8 


17111 


17090 


51.1 


40.9 


0.1 


130 


74.6 


48.5 


42 


52.6 


2489 


3509 


2377 


2395 


52.4 


52.6 


6009 


2675 


2656 


50.4 


50 


2 


299 


77 


47.2 


42 


54.1 


2490 


3510 


18590 


18608 


50.6 


42.1 


6010 


19216 


19195 


50.2 


40.9 


0.3 


627 


75."6 


41 


42 


53.1 


2491 


3511 


2377 


2395 


52.4 


52.6 


6011 


2891 


2873 


50.8 


47.4 


1.6 


515 


76.8 


44.5 


42 


54.1 


2492 


3512 


8220 


8240 


54 


47.6 


6012 


8935 


8917 


54.5 


5?,fi 


0.4 


716 


75.4 


40.1 


42 


54.1 


2493 


3613 


12370 


12388 


50.1 


47.4 


6013 


12998 


12979 


50.1 


45 


0.1 


629 


76.4 


42.9 


42 


53.6 


2494 


3514 


2223 


2244 


51.4 


45.5 


6014 


2675 


2656 


50.4 


50 


1 


453 


77 


45.3 


42 


54.1 


2495 


3515 


2220 


2239 


51.3 


45 


6015 


2672 


2654 


50.9 


52.6 


0.4 


453 


77 


45.3 


42 


54.2 


2496 


3516 


24418 


24439 






3016 


24936 


24919 


51.8 


50 


1.1 


519 


76 


42.4 


42 


53.8 


2497 


3517 


18586 


18603 


50.4 


44.4 


6017 


19216 


19195 


50.2 


40.9 


0,2 


.631 


75.6 


40.9 


42 


53.1 


2498 


3518 


2220 


2239 


51.3 


45 


6018 


2675 


2656 


50.4 


50 


0.8 


456 


76.9 


45.2 


42 


54.1 


2499 


3519 


1402 


1422 


50.2 


42.9 


6019 


2153 


2134 


50.4 


45 


0.2 


752 


76.7 


43.1 


42 


53.8 


2500 


3520 


1356 


1375 


53.8 


55 


B020 


2153 


2133 


52.1 


42.9 


1.7 


798 


76.9 


43.5 


42 


54.5 



Table 5: Primers 



Forwa 
ID NO j 


rd primer SEQ 
St Co-ordinates 


Reverse primer SEQ 
iD NO & Co-ordinates 


T„(FOR&REV)(°C) 


Product 
length (bp) 


6076 


1-19 


6171 


199-183 


60.1 


50.3 


199 


6077 


149-169 


6172 


334-315 


51.5 


52.4 


186 


6078 


292-310 


6173 


560-541 


50.8 


51.1 


269 


6079 


598-619 


6174 


749-731 


52.6 


50.6 


152 


6080 


721-742 


6175 


930-912 


50.4 


50.3 


210 


6081 


888-912 


6176 


1077-1058 


52.8 


51.2 


190 


6082 


984-1003 


6177 


1149-1131 


51.1 


51.1 


166 
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6083 


1 1 57-1 1 75 


6178 


l*l-/y-l4DU 


50.9 


51.6 


323 


6084 


1420-1441 


6179 


1 700-1 680 


51 .2 


50.7 


281 


6085 


1 685-1 707 


6180 


1 834-1 81 1 


53.8 


53.7 


150 


6086 


1740-1764 


6181 


1 987-1 963 


53.4 


52.2 


248 


6087 


2007-2025 


6182 


2251-2232 


50.3 


50.1 


245 


6088 


2226-2245 


6183 


2385-2366 


50.4 


50.1 


160 


6089 


2428-2446 


6184 


2749-2728 


50.1 


50.3 


322 


6090 


2742-2763 


6185 


2893-2875 


50.6 


51.4 


152 


6091 


2823-2844 


61 86 


3082-3058 


50.4 


52.3 


260 


6092 


3007-3031 


61 87 


3185-3164 


51 .9 


51 .0 


179 


6093 


3234-3254 


6188 


3497-3478 


51.1 


51.3 


264 


6094 


3453-3476 


6189 


3647-3627 


51.8 


52.1 


195 


6095 


3601 -3622 


6190 


3877-3853 


52.5 


53.6 


277 


6096 


4007-4027 


6191 


4158-4135 


51.1 


51.4 


152 


6097 


4141-4165 


6192 


4316-4295 


51.3 


50.8 


176 


6098 


4366-4387 


6193 


4567-4544 


54.6 


55.4 


202 


6099 


• 4488-4508 


6194 


4708-4690 


50.7 


50.3 


221 • 


6100 


4658-4677 


6195 


4994-4974 


50.5 


51.2 


337 


6101 


4902-4922 


6196 


51 1 5-5092 


50.5 


51.4 


214 


6102 


5239-5260 


6197 


5450-5430 


50.8 


50.9 


212 


6103 


5366-5389 


61 98 


5560-5542 


50.5 


51 .8 


195 


6104 


5593-561 2 


61 99 


5860-5836 


50.8 


51.6 


268 


6105 


. 6042-6062 


6200 


6291 -6271 


50.4 


51.1 


250 


6106 


6271 -6291 


6201 


6483-6463 


51.1 


50.2 


213 


6107 


7017-7040 


6202 


7171-7153 


52.4 


52.8 


155 


6108 


7253-7272 


6203 


7504-7486 


50.3 


50.3 


252 


6109 


7415-7434 


6204 


7677-7654 


54.5 


5i3.6 


263 


6110 


761 5-7635 


6205 


7821-7798 


51.1 


52.8 


207 


6111 


7728-7746 


6206 


7936-7915 


51 .7 


50.1 


209 


6112 


7845-7867 


6207 


7994-7970 


52.7 


53.4 


150 


6113 


8011-8029 


6208 


8189-8170 


51 .4 


50.6 


179 


61 14 


Q-1 >io 0-1 etet 


6209 




52.2 


50.8 


158 


6115 


8221-8239 


6210 


8388-8369 


51.0 


51.1 


168 


6116 


8553-8575 


6211 


8931-8915 


51.8 


50.3 


379 


6117 


8867-8886 


6212 


9254-9236 


50.7 


50.6 


388 
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6118 


9244-9267 


6213 


9597-9573 


51.9 


53.4 


354 


6119 


9620-9640 


6214 


9990-9969 


51.3 


51.3 


371 


6120 


10009-10027 


6215 


10188-10171 


50.2 


50.2 


180 


6121 


10093-10113 


6216 


10244-10223 


52.4 


50.6 


152 


6122 


10242-10265 


6217 


10608-10589 


51.2 


51.0 


367 


6123 


10549-10571 


6218 


10783-10763 


53.7 


55.2 


235 


6124 


10766-10785 


6219 


10930-10912 


52.0 


51.1 


165 


6125 


11065-11085 


6220 


11305-11287 


50.7 


50.0 


241 


6126 


11265-11287 


6221 


11429-11405 


54.5 


53.5 


165 


6127 


11552-11571 


6222 


11730-11709 


52.0 


50.4 


179 


6128 


11705-11726 


6223 


11869-11848 


50.1 


50.2 


165 


6129 


11801-11824 


6224 


11984-11967 


51 .5 


50.4 


184 


6130 


12040-12058 


'■ 6225 


12254-12235 


52.3 


51 .9 


215 


6131 


12235-12253 


6226 


12406-12388 


50.1 


50.1 


172 


6132 


12366-12384 


' 6227 


12730-12712 


51.7 


52.2 


365 


6133 


12727-12748 


6228 


12994-12976 


50.8 


50.3 


268 


6134 


12948-12966 


6229 


13224-13201 


50.7 


51.7 


277 


6135 


. 13175-13196 


6230 


13324-13300 


54.3 


55.1 


150 


6136 


13237-13258 


6231 


13545-13526 


52.9 


52.9 


309 


6137 


13790-13810 


6232 


13963-13945 


50.9 


50.7 


174 


6138 


14080-14098 


6233 


14280-14257 


51.5 


51 .0 


201 


6139 


14405-14427 


6234 


14561-14540 


50.2 


50.9 


1 57 


,6140 


14882-14906 


6235 


15046-15024 


50.9 


51.5 


1 65 


6141 


14951-14976 


6236 


15145-15124 


53.1 


52,9 


195 


6142 


15113-15134 


6237 


15275-15257 


51.6 


50.8 


163 


6143 


15211-15230 


6238 


15383-15363 


50.2 


50.1 


173 


6144 


15364-15387 


6239 


15528-15506 


54.0 


52.1 


165 


6145 


15456-15477 


6240 


15605-15585 


52.0 


53.2 


150 


6146 


15513-15532 


6241 


15897-15876 


51.2 


50.4 


385 


6147 


15837-15856 


6242 


15999-15978 


52.3 


50.8 


163 


6148 


16073-16096 


6243 


16301-16277 


51.7 


52.8 


229 


6149 


16245-16266 


6244 


16404-16380 


50.3 


52.0 


160 


6150 


16366-16385 


6245 


16515-16492 


52.9 


53.8 


150 


6151 


16553-16571 


6246 


16777-16758 


53.4 


51.5 


225 


6152 


16832-16852 


6247 


17026-17004 


51.0 


51.6 


195 
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6153 


1RQR?-17n01 


6248 


1735Q-1 7*^40 

i / \3iJ^ 1 % OtU 


51 2 


50.2 


378 


6154 


1 y^RA-l 7*^79 

1 r OvJ*r^ 1 / O / ^ 


6249 


17m i-i7AQn 

1 f O 1 11/ ^S7l/ 


51 3 


50.4 


158 


6155 


1 7422-1 7443 


6250 


I/O / O 1 / s^\J£L. 


50.2 


51 .1 


152 


6156 




6251 


177fiQ- 1774ft 

1 / / DS7 III HO 


50 7 


51.5 


167 


6157 


17728-17746 


6252 


17ftft9-17RRP 


50.9 


51 .2 


156 


6158 


1 iDKJ III 0\JO\J 


6253 


1 fti ft'^-1 R1 Ar\ 

\ O 1 DO- 1 O 1 fU 


52 9 


51.9 


153 


6159 


1 OU 1 tJ- 1 OUoO 


6254 


1 ft99t5-i ft^nf^ 


54 4 


55.0 


150- 


6160 


1 ft97n-1 


6255 




51 .9 


51 .4 


163 


6161 


1 ooo^-1 OO / O 


6256 


Iod4o-i OOidy 


51 .3 


50.8 


297 


61 62 


1 oOtJU-loo/l 


6257 


18702-18684 


50.4 


51.9 


153 


6163 




6258 


1 9004-1 8983 


50,6 


51 .0 


285 


61 64 


1 tsabu-ioyo 1 


6259 


19109-19085 


54.7 


54.3 


150 


• 6165 


1 yuDo-1 »uc>9 


6260 


19217-19195 


52.8 


51 .7 


1 53 


6166 






iy4/D-iy4o4 


50.2- 


52.1 


1 67 


6167 


1 yooy-i 90o9 




ly/i y-1 y/u 1 


50.5 


51.8 


151 


6168 


19707-19731 


6263 


19856-19833 


55.7^ 


55.9 


. 150 


6169 


19771-19792 


6264 


19921-19901 


50.1 


50.2 


151 


6170 


19833-19851 


6265 


19986-19966 


50.9 


50.7 


154 


able 6: Primers 


Forward primer SEQ 
ID NO & Co-ordinates 


Reverse primer SEQ 
ID NO & Co-ordinates 


Tm (FOR & REV) ("C) 


Product 
length (bp) 


6266 


20110-20132 


6305 


20425-20404 


51.9 


50.9 


316 


6267 


20468-20492 


6306 


20617-20596 


53.2 


53.5 


150 


6268 


20557-20578 


6307 


20891-20871 


50.4 


50.6 


335 


6269 


20838-20856 


6308 


21037-21015 


52.5 


52.0 


200 


6270 


21 096-21 1 1 6 


6309 


21295-21272 


50.1 


51.7 


200 


6271 


22173-22194 


6310 


22414-22395 


52.4 


51.0 


242 


6272 


22320-22342 


6311 


22501-22479 


54.8 


54.3 


182 


6273 


22532-22552 


6312 


22695-22675 


50.6 


50.0 


164 


6274 


22712-22736 


6313 


22873-22852 


56.7 


55.5 


162 


6275 


22842-22861 


6314 


23086-23067 


51.0 


52.8 


245 


6276 


23151-23170 


6315 


23395-23376 


51 .4 


50.3 


245 


6277 


23307-23326 


6316 


23524-23501 


51.1 


51.1 


218 


6278 


23615-23635 


6317 


23776-23758 


50.7 


50.2 


162 
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6279 


23838-23857 


6318 


P'^QQR-P'^Q77 


50 4 


50.6 


159 


6280 


24030-24051 


6319 


P44.n7-P4'^RR 


57 6 


55.7 


378 


6281 


24388-24407 


6320 




50.4 


50.1 


194 


6282 




6321 




52.0 


50.4 


380 


6283 




6322 


1 04-^0 1 Ob 


50.1 


51.2 


263 


6284 




6323 


PR4.nn ORQQo 


51 .1 


51 .4 


200 


6285 




6324 




51.1 


50.5 


284 - 


6286 


25656-25681 


6325 




54.5 


56.4 


184 


6287 


25761 -25782 


6326 




54.6 


54.3 


222 


6288 


26039-26058 


6327 


<£D1 oy-i£D 1 DO 


54.0 


53.0 


151 


6289 


2B1 ft4-PRPn*^ 

1 OH- ^ vj^vjo 


6328 




50.9 


51.8 


150 


6290 


o«/ioo-o«/i/io 


6329 




51 .3. 


50.2 


239 


6291 


26571-26589 


6330 


PR7'5Q K 


51 .7 


53.2 


169 


6292 






<ibyou-<iby4i 


51.1 


52.2 


228 


6293 


PR RRR-PRRR 




OT^ on 0~7H H "7 

<£/ icjy-ii/i 1 7 


50.7 


51.9 


274 


6294 


P7^nn-P7^P1 

^ # OuLI— / Oi^ 1 




d. 1 f 4oy 


51.2 


50.2 


159 


6295 


27361 -27380 


6334 


P7'^7Q 07KKa 

^loi y-i£ / ooo 


52.4 


51.1 


219 


6296 


27718-27740 


' 6335 


<t/yi /-/i/yui 


50.7 


50.0 


200 


6297 


28041 -28059 


6336 


^ij*iu/-^oi cjy 


50.8 


50.8 


167 


6298 


281 66-281 89 


6337 


OOAi 1 ODOQO 

<io'H 1 -^tjoyo 


52.2 


52.9 


246 


6299 


28395-28414 


6338 




51 .5. 


50.2 


277 


6300 


28654-28672 


6339 




50.6 


52.3 


168 


6301 


28867-28885 


6340 


29184-29166 


51 .5 


51.6 


318 ' 


6302 


29183-29204 


6341 


29360-29342 


50.4 


50.4 


178 


6303 


29262-29279 


6342 


29626-29606 


50.1 


50.2 


365 


6304 


29538-29559 


6343 


29690-29670 


50.0 


50.4 


153 



Table 7: Primers 



Name 


SEQ ID NO: 


Co-ordinates 




Name 


SEQ ID NO: 


Co-ordinates 


AB4f 


6344 


19869-19888 




CB1r 


6367 


28011-28030 


AB5f 


6345 


20238-20257 




CB2r 


6368 


27671-27690 


BCIf 


6346 


20581-20600 




CB3r 


6369 


27301-27320 


BC2f 


6347 


20950-20969 




CB4r 


6370 


26931-26950 


BC3f 


6348 


21339-21358 




CB5r 


6371 


26575-26594 


BC4f 


6349 


21708-21727 




CB6r 


6372 


26191-26210 


BC5f 


6350 


22041-22060 




CB7r 


6373 


25841-25860 


BC6f 


6351 


22410-22429 




CB8r 


6374 


25476-25495 


BC7f 


6352 


22759-22778 




CB9r 


6375 


25126-25145 
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BC8f 


6353 


23131-23150 




CBIOr 


6376 


24791-24810 


BC9f 


6354 


23500-23519 




CB11r 


6377 


24422-24441 


BC10f 


6355 


23841-23860 




CB12r 


6378 


24031-24050 


BC11f 


6356 


24210-24229 




CBISr 


6379 


23673-23692 


BC12f 


6357 


24560-24579 




CB14r 


6380 


23298-23317 


BC13f 


6358 


24941-24960 




CB15r 


6381 


22928-22947 


BC14f 


6359 


25310-25329 




CB16r 


6382 


22567-22586 


BC15f 


6360 


25675-25694 




CB17r 


6383 


22196-22215 


BC16f 


6361 


26044-26063 




CB18r 


6384 


21831-21850 


BC17f 


6362 


26413-26432 




CB19r 


6385 


21431-21450 


BC18f 


6363 


26763-26782 




CB20r 


6386 


21073-21092 


BC19f 


6364 


27132-27151 




CB21r 


6387 


20715-20734 


BC20f 


6365 


27491-27510 




BAIr 


6388 


20345-20364 


BC21f 


6366 


27845-27864 




BA2r 


6389 


19969-19988 










BASr 


6390 


19599-19618 










BA4r 


6391 


19228-19247 










BASr 


6392 


18852-18871 



Table 8: Primers 



4ame 


SEQ ID NO 


Co-ordinates 


Name 


SEQ ID NO 


Co-ordinates 


F1 


6393 


1-19 


R1 


6441 


334-315 


F2 


6394 


292-310 


R2 


6442 


749-731 


F3 


6395 


721-742 


R3 ' 


6443 


1077-1058 


F4 


6396 


984r1003 


R4 


6444 


1479-1460 


F5 


6397 


1420-1441 


R5 


6445 


1834-1811 


F6 


6398 


1740-1764 


R6 


6446 


2251-2232 


F7 


6399 


2226-2245 


R7 , 


6447 


2749-2728 


F8 


6400 


2742-2763 


.. R8 


6448 


3082-3058 


F9 


6401 


3007-3031 


R9 


6449 


3497-3478 


F10 


6402 


3453-3476 


RIO 


6450 


3877-3853 


F11 


6403 


4.007-4027 


R11 


6451 


4316-4295 


F12 


6404 


4366-4387 


R12 


6452 


4708-4690 


F13 


6405 


4658-4677 


R13 


6453 


5115-5092 


F14 


6406 


5239-5260 


R14 


6454 


5560-5542 


F15 


6407 


5593-5612 


R15 


6455 


6291-6271 


F16 


6408 


6271-6291 


R16 


6456 


7171-7153 


F17 


6409 


7253-7272 


R17 


6457 


7677-7654 


F18 


6410 


7615-7635 


R18 


6458 


7936-7915 


F19 


6411 


7845-7867 


R19 


6459 


8189-8170 


F20 


6412 


8143-8166 


R20 


6460 


8388-8369 


F21 


6413 


8553-8575 


R21 


6461 


9254-9236 


F22 


6414 


9244-9267 


R22 


6462 


9990-9969 


F23 


6415 


10009-10027 


R23 


6463 


10244-10223 


F24 


6416 


10242-10265 


R24 


6464 


10783-10763 


F25 


6417 


10766-10785 


R25 


6465 


11305-11287 


F26 


6418 


11265-11287 


R26 


6466 


11730-11709 


F27 


6419 


11705-11726 


R27 


6467 


11984-11967 


F28 


6420 


12040-12058 


R28 


6468 


12406-12388 


F29 


6421 


12366-12384 


R29 


6469 


12994-12976 


F30 


6422 


12948-12966 


R30 


6470 


13324-13300 


F31 


6423 


13237-13258 


R31 


6471 


13963-13945 
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F32 


6424 


14080-1 409£ 


F33 


6425 


14882-14906 


F34 


6426 


15113-151 3A 


F35 


6427 


1 5364-1 5387 


F36 


6428 


15513-15532 


F37 


6429 


16073-16096 


F38 


6430 


16366-16385 


F39 


6431 


1 6832-1 6852 


F40 


6432 


17354-17372 


F41 


6433 


17603-17623 


F42 


6434 


1 801 1 -1 8030 


F43 


6435 


18270-18292 


F44 


6436 


18550-18571 


F45 


6437 


18960-18981 


F46 


6438 


19310-19329 


F47 


6439 


19707-19731 


F48 


6440 


19833-19851 



Table 9: Primers 





Name 




1 


CB12R 


6488 


2 


R001 0 


6489 


3 


R0011 


6490 


4 


R0012 


6491 


5 


BNI-ED 


6492 


6 


BNI-EU 


6493 


7 


SAR1S-U 


6494 


8 


SARIAs-D 


6495 


9 


SAR1S 


6496 


10 


SARI As 


6497 


11 


IN2-U 


6498 


12 


IN4-D 


6499 


13 


IN-2 


6500 


14 


IN-4 


6501 


15 


IN-6 


6502 


16 


IN-7 


6503 


17 


COR1-U 


6504 


18 


COR2-D 


6505 


19 


COR-1 


6506 


20 


COR-2 


6507 


21 


HKUF-U 


6508 


22 


HKUR-D 


6509 


23 


HKU-F 


6510 


24 


HKU-R 


6511 


25 


1451-D 


6512 


26 


1451-U 


6513 


27 


690-D 


6514 


28 


690-U 


6515 


29 


690-D2 


6516 



R32 


6472 


14561-14540 


R33 


6473 


lO I40-10124 


R34 


6474 


1 OoHo-l 5oDo 


R35 


6475 


1 ODUiD- 1 OOoO 


R36 


6476 


1 oyyy-1 oy7o 


R37 


6477 


1 6404-1 6380 


R38 


6478 


1 b777-1 6758 


R39 


6479 


1 / ooy-1 7340 


R40 


6480 


t /o/o-i food 


R41 


6481 


H "7000 H "TOOn 


R42 


6482 




R43 


6483 


18648-18629 


R44 


6484 


19004-18983 


R45 


6485 


19217-19195 


R46 


6486 


19719-19701 


R47 


6487 


19921-19901 



— 


Name 


SEQ ID NO:' 


37 


EMC8-D2 


6524 


38 


EMC8-U2 


6525 ! 


-~— 




6526 


40 


rihAr^A -4 11 
bMCI 1-U 


6527 


41 


ORFI B-D 


6528 




vJHrl D-U 


6529 


43 


ORFS-D 


6530 


44 


ORFS-U 


6531 


45 


E7-717F 


6532 


46 


E8-85R 


6533 


47 


E8-307F 


6534 


48 


E11-771F 


6535 


49 


E11-96R 


6536 


50 


CON1-F 


6537 


51 


CON1-U 


6538 


52 


CON2-F 


6539 


53 


CON2-R 


6540 


54 


CON3-F 


6541 


55 


CON3-R 


6542 


56 


15-F 


6543 


57 


15-R 


6544 


58 


15-F2 


6545 


59 


15-R2 


6546 


60 


13-F 


6547 


61 


13-R 


6548 


62 


13-F2 


6549 


63 


13-R2 


6550 


64 


CONTIG-F 


6551 


65 


QT3-R 


6552 
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30 


690-U2 


6517 


31 


EMC7-D 


6518 


32 


EMC7-U 


6519 


33 


EMC7-D2 


6520 


34 


EMC7-U2 


6521 


35 


EMC8-D 


6522 


36 


EMC8-U 


6523 



66 


QT3-F 


6553 


67 


QIN-R 


, 6554 


68 


QIN-F 


6555 


69 


AB1-F 


6556 


70 


AB2-F 


6557 


71 


AB3-F 


6558 


72 


AB1-R 


6559 



BNSDOCID; <WO__200409236aA2_l_> 
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SARS ORF 
(SEQ ID NO) 


Length 
(aa) 


Role 


Cleavage 
site 


Features 


Cons" 




P28 (9766) 


179 


Leader protein 


179 (G/G)* 




+ 




P65 (9767) 


639 


Homologue of IWHV p65 
cleavage product 


818 (G/A) 








Nspl 
(9768) 


2422** 


Papain like protease, 
cleaves the first two 
proteins 


3240 (Q/S) 


phosphoesterase domain 
Zn binding domain 


+ 


ORFIa 


Nsp2 
(9769) 


306 


3C-fike protease, cleaves 
proteins nsp1-nsp12 


3546 (Q/G) 




+ 




iNopo \p/ fXJ) 




? 


3836 (Q/S) 


5 TMDs 


+ 




Nsp4 (9771) 


83 




391 9 (Q/A) 


1 TMD 


+ 




Nsp5 (9772) 


198 


? 


4117 (QftJ) 




+ 




Nsp6 (9773) 


113 


? 


4230 (Q/A) 




+ 




Nsp7 (9774) 


139 


? 


4369 (Q/S) 


Putative growth factor-like motif 


+ 




Nsp9 (9775) 


932 


RNA polymerase 


5298 (Q/A) 




+ 


r> 

LL 


Nsp1 0(9776) 


601 


Putative helicase 

Tanner el at (2003) J Bbl Cham 278:39578-82 


5899 (Q/A) 


Metal binding domain. 
ATP/GTP binding domain 


+ 


QC 

o 


Nsp1 1 (9777) 


527 


? 


6426 (Q/S) 




+ • ■ 




Nspl 2 (9778) 


346 


? 


6772 (Q/A) 




+ 




Nsp13 (97V9) 


298 








+ 




Spike (S) 
(6042) 




Major antigenic 
determinant, contains the 
receptor-binding domain 




Leader peptide. 1 TMD. 17 N- 
glycosylation sites 


+ 




OrfS (6043) 


274 


? 




2 TMDs. 1 N-glycosylation site, 10 
O-Qlycosylation sites 






Orf4 (6044) 


154 


? 






- 


c 


Envelope (E) 
(6045) 


76 


Associated with viral 
envelope 




1 TMD, 2 N-glycosylation sites , 




jral regie 


Matrix {M) 
(6046) 


221 


Associated with viral 
envelope, membrane 
spanning protein 




3 TMDs, 1 N-glycosylation site 




<o 


Orf7 
(6047) 


63 






1 TMD 






OrfS (6048) 


122 


? 




1 TMD 


- 




Orf9 (6049) 


44 


7 




Surface-associated 






OrflO 


39 


? 




Surface-associated 






Orfl 1(6050) 


84 


? 




1 N-glycosylationsite 






Nucleocapsid (N) 
(6052) 


422 


Associated with viiat genomic RNA 




phosphoprotein 


+ 




Orfl 3 


98 


? 









Cons'' *: + indicates presence of corresponding protein at least in one of the other coronaviruses 
#: Alternatively, cleaved after Gly-Gly {le. at G/A) to give a ISOmer 
5 ##: This 2422mer may be further cleaved after residue 1922 (Gly-2740 of SEQ ID NO- 6039) to 
give a 1922mer PLpro containing the Zn-binding motif (SEQ ID NO: 7254) and a SOOmer 
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Table 11: Protein homologies between SARS and other coronaviruses 

Numbers indicate percentage of aminoacid identity between SARS proteins and corresponding 
gene products of other coronaviruses. More conserved pairs are in bold; more variable pairs are 
underlined. 





group 1 


group 2 


group 3 


Proteins 


22QB 


TGV 


PEDV 


MHV 


BCoV 


AIBV 


REPLICASE REGION 


leader protein p28 


<20 


<20 


<:20 


27 


<20 


^20 


p65 homotogue 


<20 


23 


23 


<20 


20 


<20 


nsp1 (PLP protease) 


25.5 


25.8 


25.4 


29 


30 


25 


nsp2 (3CL protease) 


40.4 


43.8 


44.6 


50 


48.4 


41 


nsp3' 


30 


2Z 


29.4 


34.2 


35.5 


28.5 


nsp4 


38.6 


42.2 


39.8. 


47.5 


46.1 


37.3 


nsp5 


48.2 


42.9 


43.9 


46.8 


47.3 


38.7 


nsp6 


45.1 


36.9 


45.1 


45.1 


46.9 


39.8 


nsp7 


53.8 


54.5 


56.1 


56.2 


55.4 


58.3 


nsp9 (polymerase) 


59.8 


59.6 


60 


67.3 


66.9 


62.4 


nsp10 (helicase) 


60.7 


62 


62.3 


67.2 


68.6 


58.9 


nspl 1 


52.3 


53.7 


52.3 


57.6 


57.6 


52 


nsp12 


43.1 


43 


45.4 


45.9 


45 


40.2 


nsp13 


56.4 


54.4 


55,3 


63 


65 




STRUCTURAL REGION 


Spike (S) 


28.8 


31.6* 


30.3 


31.1 


31 


32.7* 


Envelope (E) 


33* 


27.9 


20 


23 


26.5 


23.2 


Matrix glycoprotein (M) 


30.6 


32.5 


34.8 


40.8 


41.9 


32.5 


Nucleocapsid (N) 


26.9 


30.1 


29.5 


37.3 


37.4 


31.5 



* These three alignments were obtained only on a fragment of the whole protein. 
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Table 12: Nucleotide and aminoacid differences between five SARS isolates 







FRA* 


TOR2* 


Urbani* 


CUHK* 


HKU* 




position" 


base/aminoacid 


base/aminoacid 


base/aminoacid 


base/aminoacid 


base/aminoacid 




""iis — 



A/Thr 


G/Ala 


G/Ala 


G/Ala 


G/Ala 






3VVal 








C 




^^^^ 


G/Pro 






T 






7919 


C/Ala 




T/Val 






ORFIa 


7930 


G/Asp 








A/Asn _ . 




8387 


G/Ser 












8416 


G/Arg 








^^r 




9404 


T/Val 






C/Ala 






9479 


T/Val 






C/Ala 






11448 


T/lle 


c 


C 


c 


Q 




13494 


GT/Val 








AG/Ser 




16622 


C/Ala 




T 








17564 


T/Asp 






C/Glu 




ORFIb 


17846 


C/Arg 






T 






18065 


G/Lys 








A ~ 




18965 


A/lle 


T 


T 


T 


-]- 




19064 


A/Glu 




G 


G 






19084 


T/lle 


C/Thr 


C/Thr 


C/TTir 


C/Thr 




21721 


G/Gly 






A/Asp 






22222 


T/lle 






C/Thr 




spike 


23220 


T/Ser 


G/Ala 
















C 








24933 


T/Phe 


C/Leu 




C/Leu 


C/Leu 


ORF3 


25298 


G/Gly 


A/Arg 










25569 


T/Met 








A/Lys 


matrix 


26600 


T/Val 


C/Ala 


C/Ala 


C/Ala 






26857 


T/Ser 




C/Pro 






ORF10 


27827 


T/Cys 






C/Arg 




nucleocapsid 


28268 1 T/lle 


C/Thr 


C/Thr 


C/Thr 


C/Thr 



* SARS coronavirus FEIA (accession number AY3 10120) 
SARS coronaviras T0R2 (accession number AY2741 19) 
SARS coronavirus Urbani (accession number AY278741) 
SARS coronavirus CUHK-Wl (accession number AY278554) 
SARS coronavirus HKU-39849 (accession number AY278491) 



° The position is based on the FRA sequence. 
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TABLES 13-25: T-epitope predictions for SEQ ID NOS: 6039-6050 & 6052 

Epitope predictions were peifonned at http:/Avww.rnpiW-berlin.mpg.deMAPPP/binding.httnl using a 
miniTTmrn scoie of 0.5 and the BIMAS matrix, with a maximum of 20 results being selected- The 
analysis revealed 9mer and lOmer epitopes. 

Table 13: Epitopes for SEQ ID NO: 6039 



HLA Al - 9 mers 



iMaximum possible score using tliis molecule type 


5625 1 




Start position 


Sequence 


% of max. score 


ScoreJ 


1 


1867 


SEQ ID NO: 7400 


8 % 


450 1 


2 


4139 


SEQ ID NO: 7401 


5.55 % 


312.5 J 


3 


88 


SEQ ID NO: 7402 


4 % 


225 1 


4 


4249 


SEQ ID NO: 7403 


3.55 % 


200 1 


5 


4059 


SEQ ID NO: 7404 


2.22 % 


125 1 


6 


2027 


SEQ ID NO: 7405 


1.6 % 


90 1 


7 


3413 


SEQ ID NO: 7406 


1.11 % 


62.5 1 


8 


1823 


SEQ ID NO: 7407 


0.88 % 


50 1 


9 


2798 


SEQ ID NO: 7408 


0.88 % 


50 1 


10 


220 


SEQ ID NO: 7409 


0.8 % 


45 


11 


3738 


SEQ ID NO: 7410 


0.8 % 


45 


12 


4182 


SEQ ID NO: 7411 


0.8 % 


45 


13 


4174 


SEQ ID NO: 7412 


0.66 % 


1 37.5 


14 


1940 


SEQ ID NQ: 7413 


0.55 % 


1 31.25 


15 


38 


SEQ ID NO: 7414 


0.48 % 


1 27 


16 


1231 


SEQ ID NO: 7415 


0.44 % 


1 25 


17 


1613 


SEQ ID NO: 7416 


0.44 % 


1 25 


18 


3645 


SEQ ID NO: 7417 


0.44 % 


1 25 


19 


4192 


SEQ ID NO: 7418 


0.44 % 


1 25 


20 


378 


SEQ ID NO: 7419 


0.4 % 


1 22.5 








HLA Al - 10 mers 




l^laximum possible score using this molecule type 


1 5625 


Rank 


Start position 


1 Sequence 


1 % of max. score 




1 


1867 


1 SEQ ID NO: 7420 


1 8 % 


450 


2 


1495 


j SEQ ID NO: 7421 


1 4 % 


225 


3 


3921 


1 SEQ ID NO: 7422 


11 2.4 % 


135 


4 


486 


1 SEQ ID NO: 7423 


II 2.22 % 


125 


5 


4139 


1 SEQ ID NO: 7424 


1 2.22 % 


125 
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6 


1 62 


SEQ ID NO: 7425 


1 1.6 % 


90 


7 


1 1190 


SEQ ID NO: 7426 


1.6% 


90 


8 


1 1284 


SEQ ID NO: 7427 


1.6 % 


90 


9 


1 3284 


SEQ ID NO: 7428 


1.6 % 


90 


10 


1 2921 


SEQ ID NO: 7429 


1.2 % 1 


67.5 


11 


1 349 


SEQ ID NO: 7430 


0.8 % 


45 


— 12 


1 789 


SEQ ID NO: 7431 


0.8 % 


45 


13 


1 1185 


SEQ ID NO: 7432 


0.8 % 


45 


14 


1 4184 


SEQ ID NO: 7433 


0.8 % 


45 


15 


1313 


SEQ ID NO: 7434 


0.64 % 


36 


16 


3948 


SEQ ID NO: 7435 


0.48 % 1 


27 






bcQ ID NO: 7436 


0.44 % 1 


25 


18 


941 


SEQ ID NO: 7437 


0.44 % 1 


25 


19 


1390 


SEQ ID NO: 7438 


0.44 % 


25 


20 


1613 


SEQ ID NO: 7439 


0.44 % 1 


25 



H LA A3 - 9 mers 



Maximum possible score using thiis molecule type 


12150 


Rank 


Start position 


1 


Sequence 


1 9/6 of max. score 


Score 


1 


1010 


4 


SEQ ID NO: 7440 


1 1.48% 


180 


2 


3155 




SEQ ID NO: 7441 


1 1.48 % 


180 


3 


1229 


1 


SEQ ID NO: 7442 


1 1.23 % 


150 


4 


2405 




SEQ ID NO: 7443 


\ 0.88 % 


108 


5 I 


2 


1 


SEQ ID NO: 7444 


1 0.74 % 


90 


6 1 


2304 


1 


SEQ ID NO: 7445 


1 0.74 % 


90 


7 1 


2358 




SEQ ID NO: 7446 


i 0.74 % ~ 


90 


8 1 


3160 


1 


SEQ ID NO: 7447 


0.74 % 


90 


.9 1 


3771 




SEQ ID NO: 7448 


1 0.74 % 


90 


10 1 


4007 




SEQ ID NO: 7449 


0.74 % 


90 


11 1 


3079 


II 


SEQ ID NO: 7450 


0.66 % 


81 


12 1 


4045 




SEQ ID NO: 7451 


0.66 % 


81 


13 1 


1081 


II 


SEQ ID NO: 7452 


0.49 % 


60 


14 1 


3268 


1 


SEQ ID NO: 7453 


0.49 % 


60 


15 1 


4144 


+ 


SEQ ID NO: 7454 


0.49 % 


60 


16 1 


614 




SEQ ID NO: 7455 


0.37 % 


45 


17 1 


728 


1 


SEQ ID NO: 7456 


0.37% 1 


45 


18 1 


1537 




SEQ ID NO: 7457 


0.37 % 


45 


19 1 


313 


1 


SEQ ID NO: 7458 


0.32 % 


40 


20 I 


1744 


1 


SEQ ID NO: 7459 


0.32 % 


40 
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HLA A3 - 10 mers 


Maximum possible score using tills molecule type 


12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 




62 


SEQ ID NO: 7460 


4.44 % 


540 


2 1 


2151 


SEQ ID NO: 7461 


2.46 % 


300 


3 


633 


SEQ ID NO: 7462 


2.22 % 


270 


4 


1158 


SEQ ID NO: 7463 


2.22 % 


-270 


5 


2565 


SEQ ID NO: 7464 


2.22 % 


270 


6 


2298 


SEQ ID NO: 7465 


1.77 % 


216 


7 


3159 


SEQ ID NO: 7466 


1.11 % 


135 


8 


640 


SEQ ID NO: 7467 


0.98 % 


120 


9 


I 2186 


SEQ ID NO: 7468 


0.74 % 


90 


10 


i 3869 


SEQ ID NO: 7469 


0.74 % 


90 


MM 


1 2308 


SEQ ID NO: 7470 


0.66% 


81 


ri2n 


r 786 


SEQ ID NO: 7471 


0.55 % 


67.5 


13 


I 749 


SEQ ID NO: 7472 


0.49 % 


60 


14 


I ' 1080 


SEQ ID NO: 7473 


0.49 % 


1 60 


15 


1 2358 


SEQ ID NO: 7474 


0.49 % 


1 60 


16 


1 3955 


SEQ ID NO: 7475 


0.49 % II 60 


17 


1 714 


SEQ ID NO: 7476 


0.37 % II 45 


18 


1 1081 


SEQ ID NO: 7477 


0.37 % II 45 


19 


! 1170 


SEQ ID NO: 7478 


0.37 % ; II 45 


20 


1 1228 


SEQ ID NO: 7479 


0.37% II 45 



HLAA24-9mers 


Maximum possible score using tliis molecule type 


1596.672 


Rank j 


Start position 


Sequence 


9/o of max. score 


Score 


1 i 


3797 


SEQ ID NO: 7480 


37.57 % 1 


600 


2 


4202 


SEQ ID NO: 7481 


37.57 % 1 


600 


3 


3189 


SEQ ID NO: 7482 


25.05 % 1 


400 


4 ! 


1864 


SEQ ID NO: 7483 


23.14 % 1 


369.6 


5 ! 


1066 


SEQ ID NO: 7484 


22.54 % I 


360 


6 


2143 


SEQ ID NO: 7485 


22.54 % i 


360 


7 


2693 


SEQ ID NO: 7486 


22.54 % 1 


360 


8 


1426 


SEQ ID NO: 7487 


18.78% 


300 


9 


1238 


SEQ ID NO: 7488 


18.03 % 


288 


10 


3768 


SEQ ID NO: 7489 


18.03% 


288 


11 


797 


SEQ ID NO: 7490 


15.03% II 240 1 
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12 
13 


1 1882 
1490 


bEQ ID NO: 7491 
SEQ ID NO: 7492 


15.03 % 
13.77 % 


II 
II 


240 
220 


14 


1 2237 


btL2 ID NO: 7493 


13.77 % 




220 


15 


95 


ottj ID NU: 7494 


12.52 % 




200 


16 


1821 


btQ ID NO: 7495 


12.52 % 




200 


17 


2289 


SEQ ID NO: 7496 


12.52 % 


1 


200 


18 


3080 


1 SEP ID NO: 7497 


12.52 % 


■ Ji 


200 


19 


3660 1 


SEQ ID NO: 7498 1 


12.52 % 




200 


20 1 


4354 1 


SEQ ID NO: 7499 


12.52 % 




200 




1 HLA A 0201 - 9 mers 


IMaximum possible score usina this molecule tvne 


3925227.1 


iRiankll Start Dosition 


Sequence II o/„ of max. score 


Score 


1 T 1 2308 
1 2 1 3729 


bEQ ID NO: 752011 0.20 % 
SEQ ID NO: 7521 II 0.10 % 


8144.13515256 
4047.23088 
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3 


3574 


SEQ ID NO: 7522 


0.09 % II 3547.4996634 


4 


3615 


SEQ ID NO: 7523 


0.06 % II 2722.682592 


5 


3159 


SEQ ID NO: 7524 


0.05 % II 1999.734264 


6 


2339 


SEQ ID NO: 7525 


0.03 % 


1 1551.92907744 


7 


2201 


SEQ ID NO: 7526 


0.03 % 


II 1521.53694 


8 


3559 


SEQ ID NO: 7527 


0.02 % 


II 1174.38939504 


9 


3085 


SEQ ID NO: 7528 


0.02 % 


II 1146.296448 


10 


4070 


SEQ ID NO: 7529 


0.02 % 


II 970.4103696 


11 


3708 


SEQ ID NO: 7530 


0.02 % 


11 958.92888 


12 


3098 


SEQ ID NO: 7531 


0.02 % 


II 942.678 


13 


1362 


SEQ ID NO: 7532 


0.02 % 


II 900.6984 


14 


3563 i 


SEQ ID NO: 7533 


0.01 % 


II 735.86016 


15 


3774 1 


SEQ ID NO: 7534 


0.01 % 


11 687.655656 


16 


4242 1 


SEQ ID NO: 7535 


1 0.01 % 


II 685.78272 . 


17 


2340 ! 




SEQ ID NO: 7536 


1 0.01 % 


II 668.37342936 


18 


650 1 


SEQ ID NO: 7537 


0.01 % 


II 640.1983392 


19 


3862 1 


SEQ ID NO: 7538 


0.01 % 


620.57772 


20 


2860 1 


SEQ ID NO: 7539 


0.01 % II 607.88448. 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rankll Start position! Sequence 


°/o of max. score 


Score 


1 II 2307 II SEQ ID NO: 7540 


0.40% 


15915.66281448 


2 II 2201 II SEQ ID NO: 7541 


0.12 % 


4772.09313 


3 H 3558 II SEQ ID NO: 7542 


0.05 % 


2295.04855632 


4 1! 1772 II SEO ID NO: 7543 


0.04 % 


1759.6656 


5 1 


3087 II SEQ ID NO: 7544 


0.03 % 


1215.76896 


6 1 


2339 II SEQ ID NO: 7545 


0.02 % 


1116.29986272 


7 1 


2308 II SEO ID NO: 7546 


0.02 % 


970.14776112 


8 1 


3061 II SEO ID NO: 7547 


0.02 % 


836.2525104 


9 1 


2748 II SEQ ID NO: 7548 


0.01 % 


726.706344 


10 1 


3837 II SEQ ID NO: 7549 


0.01 % 


720.8292 


11 1 


59 II SEO ID NO: 7550 


0.01 % 


650.3112 


12 


2877 II SEO ID NO: 7551 


0.01 % 


620.22996 | 


13 


4114 II SEO ID NO: 7552 


0.01 % 


559.8936 | 


14 


805 II SEQ ID NO: 7553 


0.01 % 


484.4565072 


15 


1655 11 SEQ ID NO: 7554 


0.01 % 


437.48208 I 


16 


611 II SEO ID NO: 7555 


0.00 % 


319.9392 


17 


1961 II SEO ID NO: 7556 


0.00 % 


305.94186 
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18 


1223 


SEQ ID NO: 7557 


0.00 % II 289.08792 


19 


852 


SEQ ID NO: 7558 


0.00 % II 285.67242 


20 


2139 


SEQ ID NO: 7559 


0.00 % II 284.845869 







HLA A 1101 - 9 mers 




IMaximum possible score using this molecule type 


36 




1 Start position || Sequence 


II 9/6 of max. score 


Score 


1 


4200 


II SEQ ID NO: 7560 


II 50 % 


18 


2 


281 


II SEQ ID NO: 7561 


1 25 % 


9 


3 


3236 


II SEQ ID NO: 7562 


1 25 % 


9 


4 


509 


11 SEQ ID NO: 7563 II 16.66% 


6 


5 


848 


II SEQ ID NO: 7564 


1 16.66 % 


6 


6 


2193 


II SEQ ID NO: 7565 


1 16.66 % 


6 


7 


3542 


II SEQ ID NO: 7566 


1 16.66 % 


6 


8 


541 


II SEQ ID NO: 7567 


1 15 % 


5.4 


9 


1748 


1 SEQ ID NO: 7568 


1 12.5 % 


4.5 


10 


829 


II SEQ ID NO: 7569 


1 11.11 % 


4 


11 


1149 


1 SEQ ID NO: 7570 


1 11.11 % 


4 


12 


2027 


1 SEQ ID NO: 7571 


1 11.11 % 


4 


13 


2576 


1 SEQ ID NO: 7572 


1 11.11% 


4 


14 


873 


1 SEQ ID NO: 7573 


1 8.33% 


3 


15 


2725 


1 SEQ ID NO: 7574 


1 8.33 % 


3 


16 


3541 , 


1 SEQ ID NO: 7575 


1 8.33 % 


3 


17 


. 1837 


1 SEQ ID NO: 7576 


1 7.5 % 


2.7 


18 


2475 


1 SEQ ID NO: 7577 


1 7.5 % 


2.7 


19 


2703 


1 SEQ ID NO: 7578 


7.5% 


2.7 


20 


1823 


1 SEQ ID NO: 7579 


6.66 % 


2.4 






HLA A 1101 - 10 mers 11 


Maximu 


m possible score ui 


5ing this molecule type 


^6 1 


Rank 


Start position 


Sequence 


9/6 of max. score 




1 


3541 


SEQ ID NO: 7580 I 


50 % 


18 


m 


281 


SEQ ID NO: 7581 1 


25 % 


9 




1495 


SEQ ID NO: 7582 I 


25 % 


9 


4 1 


2303 


SEQ ID NO: 7583 I 


25 % 


9 


5 1 


2616 


SEQ ID NO: 7584 I 


25 % 


9 


6 1 


48 


SEQ ID NO: 7585 1 


16.66 % 


6 


7 1 


1394 


SEQ ID NO: 7586 1 


16.66 % 


6 


8 1 


1499 


SEQ ID NO: 7587 1 


16.66 % 1 


6 
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9 


1862 


SEO ID NO: 7588 


16.66 % 


6 


10 


1163 


SEO ID NO: 7589 


11.11 % 


4 


11 


4006 


SEQ ID NO: 7590 


11.11 % 


4 


12 


4344 


SEQ ID NO: 7591 


11.11 % 


4 


13 


633 


SEQ ID NO: 7592 


10 % 


3.6 


14 


119 


SEQ ID NO: 7593 


8.33 % 


3 


15 


1190 


SEQ ID NO: 7594 


8.33 % 


3 


16 


1195 


SEQ ID NO: 7595 


8.33 % 


- 3 


17 


1725 


SEQ ID NO: 7596 


8.33 % 


3 


18 


2728 


SEQ ID NO: 7597 


8.33 % 


3 


19 


2895 


SEQ ID NO: 7598 


8.33 % 


3 


20 


3033 


SEQ ID NO: 7599 


8.33 % 


3 




[1 HLA B7 - 9 mers 


iMaximum possible score usinq this molecule type 


5400 


irssm 


Start position 


Seguence 


°/o of max. score 


Score 


1 


1335 


SEQ ID NO: 7600 


4.44 % 


240 


2 


2580 


SEQ ID NO: 7601 


4.44 % 


240 


3 


1703 


SEQ ID NO: 7602 


3.70 % 


200 


4 


113 


SEQ ID NO: 7603 


2.22 % 


120 


5 


168 


SEQ ID NO: 7604 


2.22 % 


120 


6 


2842 


SEQ ID NO: 7605 


2.22 % 


120 


7 


4027 . 


SEQ ID NO: 7606 


2.22 % 


120 


8 


3680 


SEQ ID NO: 7607 


1.66 % 


90 


9 


2085 


SEQ ID NO: 7608 


1.48 % 


80 


10 


2492 


SEQ ID NO: 7609 


1.48 % 


80 


11 


2660 


SEQ ID NO: 7610 


1.48 % 


80 


12 


2906 


SEQ ID NO: 7611 


1.48 % 


80 


13 


3346 


SEQ ID NO: 7612 


1.48 % 


80 


14 


4038 


SEQ ID NO: 7613 


1.48 % 


80 


15 


1163 


SEQ ID NO: 7614 


1.11 % 


60 


16 


1457 


SEQ ID NO: 7615 


1 1.11 % 


60 


17 


2351 


SEQ ID NO: 7616 


1.11 % 


60 


18 


2471 


SEQ ID NO: 7617 


1.11 % 


60 


19 


3499 


SEQ ID NO: 7618 


1.11 % 


60 


20 


3635 


SEQ ID NO: 7619 


1.11 % 


60 



HLA B7 - 10 mers 

iMaxrmum possible score using this molecule type Hi 5400 
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Rank 


1 Start position 


Sequence 


°/o of max. score 


(1 Score 


1 


II 1703 


SEQ ID NO: 7620 


3.70 % 


II 200 


2 


1 

1 17 


SEQ ID NO: 7621 


2.22 % 


II 120 


3 


1 3008 


SEQ ID NO: 7622 


2.22 % 


II 120 


4 


1 4106 


SEQ ID NO: 7623 


2.22 % 


II 120 


5 


1 3450 


SEQ ID NO: 7624 


1.66 % 


II 90 


6 


1 113 


SEQ ID NO: 7625 


1.48 % 


1 80 


7 


C 195 


SEQ ID NO: 7626 


1.48 % 


1 - 80 


8 


1 307 


SEO ID NO: 7627 


1.48 % 


1 80 


9 


1 780 


SEQ ID NO: 7628 


1.48 % 


1 80 


10 


1 1000 


SEQ ID NO: 7629 


1.48 % 


1 80 


11 


1 1072 


SEQ ID NO: 7630 


1.48 % 


1 80 


12 


I 1404 


SEQ ID NO: 7631 


1.48 % 


1 80 


13 


1 1980 


SEQ ID NO: 7632 


1.48 % 


1 80 


14 


1 2262 


SEQ ID NO: 7633 


1.48 % 


1 80 


t« 


1 2543 


SEQ ID NO: 7634 


1.48% 


1 80 


16 


1 2906 


SEQ ID NO: 7635 


1.48 % 


1 80 


17 


1 3077 


SEQ ID NO: 7636 


1.48 % 


1 80 


18 


1 3175 


SEQ ID NO: 7637 


1.48 % 


1. 80 _ 


19 


1 4195. 


SEQ ID NO: 7638 


1.48 % 


1 80 


20 


1 4251 


SEQ ID NO: 7639 


1.48 % 


1 80 


Table 14: Epitopes for SEQ ID NO: 6040 


HLA Al - 9 mers 


Maximu 


m posisible score using tliis molecule type 


5625 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 1 


1 


20 


SEQ ID NO: 7640 


0.04 % 


2.25 1 


2 


91 


SEQ ID NO: 7641 


0.01 % 


i 1 


3 


125 


SEQ ID NO: 7642 


0.01 % 


0.75 


4 


56 


SEQ ID NO: 7643 


0.00 % 


0.5 


5 


145 


SEQ ID NO: 7644 


0.00 % 


0.5 








HLA Al - 10 mers 


Maximu 


m possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


9/6 of max. score 1 


Score 1 


1 


20 


SEQ ID NO: 7645 


0.01 % 1 


0.9 1 


2 


56 


SEQ ID NO: 7646 


0.00 % 




3 


71 


SEQ ID NO: 7647 


0.00 % 


0.5 1 
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L ^ 


1 144 


1 SEQ ID NO: 7648 


1 0.00 % 


1 0.5 II 




HLA A3 - 9 mers 


Maximum possible score using this molecule type 


12150 


Rank 


1 Start position 


Sequence 


^/o of max. score 


Score 


1 


1 115 


1 SEQ ID NO: 7649 


0.24 % 


30 


2 


1 87 


SEQ ID NO: 7650 


0.04% 


6 


3 


! 80 


SEQ ID NO: 7651 


0.03 % 


4.05 1 


4 


1 125 


SEQ ID NO: 7652 


0.01 % 


1.8 1 




39 


SEQ ID NO: 7653 


0.01 % 


1.5 1 


6 


56 


SEQ ID NO: 7654 


0.01 % 


1.5 1 


7 


135 


SEQ ID NO: 7655 


0.00 % 


1.2 1 


8 


91 


SEQ ID NO: 7656 


0.00 % 


1 1 


9 


119 1 


SEQ ID NO: 7657 


0.00 % 


1 ) 


10 


141 1 


SEQ ID NO: 7658 


0.00 % 


0.9 1 


11 


150 1 


SEQ ID NO: 7659 


0.00% 


0.6 1 


1 12 


137 1 


SEQ ID NO: 7660 


0.00 % 


0.54 



HLA A3 - 10 mers 


Maximum possible score usrnq this molecule type 


12150 


Ranic 


Start position 


1 Sequence 


°/o of max. score 


Score 


1 


36 


1 SEQ ID NO: 7661 


0.24 % 


30 


2 


144 


1 SEQ ID NO: 7662 


0.06 % 


8 


3 


101 


1 SEQ ID NO: 7663 


0.03 % 


4 


4 


99 


1 SEQ ID NO: 7664 


0.02 % 


3.6 


5 


80 


1 SEQ ID NO: 7665 


0.02 % 


2.7 


6 


125 


1 SEQ ID NO: 7666 


0.01 % 


1.6875 


. 7 


71 


1 SEQ ID NO: 7667 


0.01 % 


1.5 


8 


118 


1 SEQ ID NO: 7668 


0.01 % 


1.5 


9 


40 


1 SEQ ID NO: 7669 


0.01 % 


1.35 


10 


5 


1 SEQ ID NO: 7670 


0.00 % 


0.9 


11 


56 


SEQ ID NO: 7671 


0.00 % 


0.9 


12 


107 


SEQ ID NO: 7672 


0.00 % 


0.6 


13 


135 


SEQ ID NO: 7673 


0.00 % 


0.6 


14 


141 


SEQ ID NO: 7674 


0.00 % 


0.6 


15 


148 


SEQ ID NO: 7675 


0.00 % 


0.6 


16 


116 


SEQ ID NO: 7676 


0.00 % 


0.5 







|i HLA A24 - 9 mere 
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Maximum possible score usinq tills molecule type 


1596.672 


Rank 


Start position 


Sequence || o/o of max. score 


Score 


1 


153 


SEO ID NO: 7677 H 


1.05 % 


16.8 


2 


80 


SEQ ID NO: 7678 || 


0.75 % 


12 


3 


123 


SEQ ID NO: 7679 II 


0.50 % 


8 


4 


137 


SEQ ID NO: 7680 || 


0.50 % 


8 


5 


9 


LSEQ ID NO: 7681 || 


0.45 % 


7.2 


6 


77 


SEQ ID NO: 7682 II 


0.45 % 


7.2 


7 


112 


SEQ ID NO: 7683 II 


0.45 % 


7.2 


8 


73 


SEQ ID NO: 7684 || 


0.41 % 


6.6 


9 


32 


SEQ ID NO: 7685 II 


0.37 % 


6 


10 


110 


SEQ ID NO: 7686 || 


0.37 % 


6 


11 


140 


SEQ ID NO: 7687 || 


0.37 % 


6 


12 


143 


SEQ ID NO: 7688 || 


0.37 % 


6 


13 


18 


SEQ ID NO: 7689 || 


0.30 % 


4.8 


14 


54 


SEQ ID NO: 7690 || 


0.30 % 


4.8 


15 


108 


SEQ ID NO: 7691 || 


0.30 % 


4.8 


16 


141 


SEQ ID NO: 7692 11 


0.30 % 


4.8 


17 


92 


SEQ ID NO: 7693 || 


0.27 % 


4.4 


18 


33 


SEQ ID NO: 7694 || 


0.25 % 


4 


19 


49 


SEQ ID NO: 7695 II 


0.25 % 


4 


20 


111 


SEQ ID NO: 7696 || 


0.25 % 


4 






1 HLA A24 - 10 mers 


iMaximum possible score usinq this molecule type 


1596.672 


i Rank || Start position 


Sequence || o/o of max. score 


Score 


1 1 


142 


SEQ ID NO: 7697 II 


12.52 % 


200 


1 2 


110 


SEQ ID NO: 7698 || 


0.75 % 


12 


1 3 


99 


SEQ ID NO: 7699 II 


0.50 % 


8 


1 4 ! 


8_ 


SEQ ID NO: 7700 II 


0.45 % 


7.2 


1 5 j 


140 


SEQ ID NO: 7701 || 


0.45 % 


7.2 


1 6 II 32 


SEQ ID NO: 7702 || 


0.37 % 


6 


1.7 II 17 


SEQ ID NO: 7703 || 


0.30 % 


4.8 


1 8 II 53 


SEQ ID NO: 7704 || 


0.30 % 


4.8 


1 9 1 


76 


SEQ ID NO: 7705 II 


0.30 % 


4.8 


1 10 1 


107 


SEQ ID NO: 7706 || 


0.30 % 


4.8 




111 


SEQ ID NO: 7707 || 


0.30 % 


4.8 




72 


SEQ ID NO: 7708 || 


0.27 % 


4.4 




91 


SEQ ID NO: 7709 II 


0.27 % 


4.4 



-407- 



wo 2004/092360 



PCT/US2004/01I710 



1 14 


31 


SEQ ID NO: 7710 


0.25 % 


4 


1 15 


127 


SEQ ID NO: 7711 


0.25 % 


4 


1 16 


139 


SEQ ID NO: 7712 


0.25 % 


4 


1 17 


80 


SEQ ID NO: 7713 


0.22 % 


3.6 


1 18 


38 


SEQ ID NO: 7714 


0.18 % 


3 


1 19 


118 


SEQ ID NO: 7715 


0.18 % 


3 


20 


49 


SEQ ID^NO: 7716 


0.12 % 


2 




HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


1 3925227.1 | 


Rank 


Start position 


Sequence 


°/o Of max. score 


Score 


1 


80 


SEQ ID NO: 7717 


0.00 % 


1 171.96732 


2 


147 


SEQ ID NO: 7718 


0.00 % 


1 51.46848 


3 


143 


SEQ ID NO: 7719 


0.00 % 


1 11.6146182 


4 


56 


SEQ ID NO: 7720 


0.00 % 


H 


L 1.304684 


5 


10 


SEQ ID NO: 7721 


0.00 % 


10.34586 


6 


6 


SEQ ID NO: 7722 


0.00 % 


1 6.56830734 


7 


26 


SEQ ID NO: 7723 


0.00 % 


1 6.07614 


8 


141 


SEQ ID NO: 7724 


0.00 % 


! 5.981472 


9 


148 


SEQ ID NO: 7725 


0.00 % 


! 5.194044 


10 


9 


SEQ ID NO: 7726 


0.00 % 


! 4.299183 


11 


137 


SEQ ID NO: 7727 


0.00 % 


1 4.299183 


12 


130 


SEQ ID NO: 7728 


0.00 % 


4.138344 


13 


84 


SEQ ID NO: 7729 


0.00 % 


3.42792 


14 


27 


SEQ ID NO: 7730 


0.00 % 


3.383484 


15 


2 


SEQ ID NO: 7731 


0.00 % 


3.381 


16 


62 


SEQ ID NO: 7732 


0.00 % 


3.251556 1 


17 


23 


SEQ ID NO: 7733 


0.00 % 


2.9542005 


18 


99 


SEQ ID NO: 7734 


0.00 % 


1.982232 


19 


33 


SEQ ID NO: 7735 


0.00 % 


1.86921 


20 


111 


SEQ ID NO: 7736 


0.00 % 


1.76402985 



HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


5 


SEQ ID NO: 7737 


0.00 % 


159.9696 


2 


25 


SEQ ID NO: 7738 


0.00 % 


69.552 


3 


80 


SEQ ID NO: 7739 


0.00 % 


36.5148 


4 


107 


SEQ ID NO: 7740 


0.00 % 


21.3624 



-408- 



BNSOOOID: <W O g 0O4O923e0A2 I > 



wo 2004/092360 



PCT/US2004/011710 



5 II 148 


SEQ ID NO: 7741 


0.00 % 


17.73576 


6 II 61 


SEQ ID NO: 7742 


0.00 % 


13.9104 


7 II 147 


SEQ ID NO: 7743 


0.00 % 


11.304684 


8 


1 53 


SEQ ID NO: 7744 


0.00 % 


8.230458 


9 


1 17 


SEQ ID NO: 7745 


0.00 % 


7.3086111 


10 


1 110 


SEQ ID NO: 7746 


1 0.00 % 


6.174104475 


11 


r 9 


SEQ ID NO: 7747 


0.00 % 


6.0858 


12 


i 99 


SEQ ID NO: 7748 


0.00 % 


5.6823984 


13 


! ' ■ 


SEQ ID NO: 7749 


0.00 % 


3.188283 


14 


1 41 


SEQ ID NO: 7750 


0.00 % 


, 2.206413 


15 


1 135 


SEQ ID NO: 7751 


0.00 % 


2.076624 


16 


76 


SEQ ID NO: 7752 


0.00 % 


2.005692 


17 


23 


SEQ ID NO: 7753 


0.00 % 


1.798209 


18 


40 


SEQ ID NO: 7754 


0.00 % 


1.68996456 


19 


39 


SEQ ID NO: 7755 


0.00 % 


1.516482 


20 


118 


SEQ ID NO: 7756 


0.00 % 


1.2683304 ^ 



HLA A 1101 - 9 mers 


Maximu 


m possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


91 


SEQ ID NO: 7757 


2.77 % 


1 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


101 


SEQ ID NO: 7758 


33.33 % 


12 


2 


71 


SEQ ID NO: 7759 


2.77 % 


1 


3 


90 


SEQ ID NO: 7760 


1.66 % 


0.6 



1 HLA B7 - 9 mers 


iMaximum possible score using this molecule type 


t 5400 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


49 


SEQ ID NO: 7761 


2.22 % 


120 


2 


9 


SEQ ID NO: 7762 


1.11 % 


60 


3 


73 


SEQ ID NO: 7763 


0.66 % 


36 


4 


33 


SEQ ID NO: 7764 


0.37 % 


20 


5 


137 


SEQ ID NO: 7765 


0.37 % 


20 


6 


141 


SEQ ID NO: 7766 


0.37 % 


20 


7 


77 


SEQ ID NO: 7767 


0.22 % 


12 
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g 1 


112 1 


SEQ ID NO: 7768 


0.22 % 


12 


9 


143 1 


SEQ ID NO: 7769 


0.22 % 


12 


10 


81 


SEO ID NO: 7770 


0.14 % 


8 


11 


13 


SEQ ID NO: 7771 


0.09 % 


5 


12 


69 


SEQ ID NO: 7772 


0.09 % 


5 


13 


18 ' 1 


SEQ ID NO: 7773 


0.07 % 


4 


14 


32 1 


SEQ ID NO: 7774 


0.07 % 


4 


15 


54 j 


SEQ ID NO: 7775 


0.07 % 


-4 


16 


80 1 


SEQ ID NO: 7776 


0.07 % 


4 


17 


92 i 


SEQ ID NO: 7777 


0.07 % 


4 


18 


108 1 


SEQ ID NO: 7778 


0.07 % 


4 


19 


111 1 


SEQ ID NO: 7779 


0.07 % 


' 4 


20 


123 1 


SEQ ID NO: 7780 


1 0.07 % 


4 






1 HLA B7 - 10 mers 


Maximum possible score using tliis molecule type 1 


5400 


Rank 


Start position 


Sequence 


1 o/o of max. score 1 


Score 




1 99 II SEQ ID NO: 7781 


1 0.74 % 


40 


KH 


r 17 


1 SEQ ID NO: 7782 


1 0:37 % 


20 


3 


1 8 


1 SEQ ID NO: 7783 


1 0.22 % 


12 


4 


72 


1 SEQ ID NO: 7784 


1 0.22 % 




5 


1 91 


1 SEQ ID NO: 7785 


1 0.22 % 


m 


6 


1 127 


1 SEQ ID NO: 7786 


1 0.11 % 




7 


1 31 


SEQ ID NO: 7787 


1 0.07 % 


1 4 


8 


1 32 


1 SEQ ID NO: 7788 


1 0.07 % 


1' 4 


9 


1 53 


1 SEQ ID NO: 7789 


1 0.07% 


4 


10 


1 76 


1 SEQ ID NO: 7790 


1 0.07 % 


4 


11 


1 107 


1 SEQ ID NO: 7791 


1 0.07 % 


4 


12 


1 Ho 


1 SEQ ID NO: 7792 


1 0.07 % 


4 


13 


1 111 


1 SEQ ID NO: 7793 


1 0.07 % 


4 


14 


1 140 


1 SEQ ID NO: 7794 


1 0.07 % 


4 


15 


1 9 


1 SEQ ID NO: 7795 


1! 0.05 % 


3 


16 


1 19 


1 SEO ID NO: 7796 


II 0.05 % 


3 


17 


1 33 


1 SEQ ID NO: 7797 


11 0.03 % 


2 


18 


1 93 


1 SEQ ID NO: 7798 


II 0.03 % 


2 


19 


102 


SEQ ID NO: 7799 


ii 0.03 % 


2 


20 


129 


SEQ ID NO: 7800 


11 0.02 % 


1.5 1 
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BMSKJCID: <WO__200408238aA2J_> 



wo 2004/092360 PCT/US2004/011710 



Table 15: Epitopes for SEQ ID NO: 6041 



HLA Al - 9 mers I 


Maximum possible score using tlifs molecule type 


5625 I 


Rank 


1 Start position || Seauence 


1 °/o of max. score 


Score 1 




1 1818 


!l SEO ID NO: 7801 


1 1.6 % 


90 1 


2 


L 373 


II SEO ID NO: 7802 


1 1.33 % 


75 1 


3 


[ 681 


11 SEQ ID NO: 7803 


1 1.33 % 




4 


1 74 


II SEQ ID NO: 7804 


1 0.88 % 


50 


5 


786 


II SEQ ID NO: 7805 


1 0.88 % 


50 


6 


1495 


II SEO ID NO: 7806 


1 0.88 % 


50 


7 


88 


ll SEQ ID NO: 7807 


1 0.8 % 


45 


1 8 1 


357 


II SEQ ID NO: 7808 


0.8 % 


45 




L_ 1271 


II SEO ID NO: 7809 


0.8 % 


45 




1799 


II SEO ID NO: 7810 


0.8 % 


45 


11 


1393 


II SEO ID NO: 7811 


0.48 % 


27 


12 


386 


II SEO ID NO: 7812 


0.44 % 


25 


13 


2304 


II SEO ID NO: 7813 


0.44 % 


25 


14 


198 


ILSEQ ID NO: 7814 ! 


0.4 % 


22.5 


15 1 


840 


II SEO ID NO: 7815 ! 


0.4 % 


22.5 


16 1 


2359 


II SEO ID NO: 7816 1 


0.4 % 


22.5 


17 1 


1194 


II SEO ID NO: 7817 1 


0.32 % 


18 


18 1 


1546 


II SEO ID NO: 7818 1 


0.32 % 


18 


19 1 


2200 


H SEO ID NO: 7819 1 


0.22 % 


12.5 


20 1 


996 


II SEO ID NO: 7820 1 


0.2 % 1 


11.25 



1 HLA Al - 10 mers 


iMaximL 


m possible score using tills molecule type 


I 5625 


1 Ranic 


Start position 


Sequence 


°/o of max. score 


1 Score 


m 


995 


SEQ ID NO: 7821 


10 % 


1 562.5 




1303 


SEQ ID NO: 7822 


2.22 % 


1 125 




1582 


SEQ ID NO: 7823 


2 % 


LA12_-5 


1 4 


1456 


SEQ ID NO: 7824 


1.6% 


90 


1 = 


772 


SEQ ID NO: 7825 


1.11 % 1 


62.5 


1 6 


181 


SEQ ID NO: 7826 


0.88 % 


50 


1 7 


632 


SEO ID NO: 7827 


0.88 % 


50 


! " 


2281 


SEQ ID NO: 7828 


0.88 % 


50 


L 9 


1586 


SEQ ID NO: 7829 


0.8 % 


45 




2109 


SEQ ID NO: 7830 


0.8 % 


45 


1 11 


745 


SEQ ID NO: 7831 


0.55. % 


31.25 
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12 


1 ± jxD 


II bEQ ID NO: 7832 


0.53 % 


30 




\ yoo 


II SEQ ID NO: 7833 


0.44 % 


25 


14 


1 "SQ"? 

1 iJJo/ 


II SEQ ID NO: 7834 


0.44 % 


25 


1 c: 




11 SEO ID NO: 7835 


0.44 % 


25 


16 




II SEQ ID NO: 7836 


0.26 % 


15 


17 1 


1057 


II SEQ ID NO: 7837 


0.22 % 






2562 


II SEQ ID NO: 7838 


0.22 % 


12.5 


NH 


74 


II SEQ ID NO: 7839 


0.17 % 1 


10 




298 


II SEQ ID NO: 7840 


0.17% 


10 



HLA A3 - 9 mers 


Maximum possible score using tinis molecule type 


12150 


Rank 


1 Start position 


1 Sequence 


% of max. score 


Score 


1 


1 536 


1 SEQ ID NO: 7841 


3.33 % . 


405 


2 


1 986 


1 SEQ ID NO: 7842 


2.46 % ; 


300 


3 


1 805 


1 SEQ ID NO: 7843 


1.64 % 


200 


4 


1 2345 


1 SEQ ID NO: 7844 


1.48 % 


180 1 


cr 
D 


1 2481 


1 SEQ ID NO: 7845 


0.55 % 


67.5 


6 


L_ 204 


1 SEQ ID NO: 7846 


0.49 % 


. 60 


7 


1 895 


1 SEQ ID NO: 7847 


0.44 % 


54 


8 


1 1512 


1 SEQ ID NO: 7848 


0.44 % 


■ 54 


9 


i 2491 


1 SEQ ID NO: 7849 


! 0.37 % 


45 


10 


436 


SEQ ID NO: 7850 


0.32 % 


40 


11 


917 


SEQ ID NO: 7851 


0.32 % 


40 1 


12 1 


1176 


SEQ ID NO: 7852 


0.32 % i 


40 


13 1 


1517 1 


SEQ ID NO: 7853 


0.29 % 1 


36 


= 14 1 


466 1 


SEQ ID NO: 7854 


0.24 % 1 


30 


15 1 


1784 1 


SEQ ID NO: 7855 


0.24 % 


30 


16 1 


2039 1 


SEQ ID NO: 7856 


0.24 % 


30 


17 1 


2124 1 


SEQ ID NO: 7857 


0.24 % 


30 


18 1 


1049 1 


SEQ ID NO: 7858 


0.22 % 


27 


19 


2200 i 


SEQ ID NO: 7859 


0.22 % 


27 


20 1 


2598 i 


SEQ ID NO: 7860 


0.22 % 


27 



HLA A3 - 10 mers 


Maximu 


m possible score using tliis molecule type 


12150 


Rank j 


Start position || Sequence 


% of max. score 


Score 




392 II SEO ID NO: 7861 
2230 II SEQ ID NO: 7862 


2.46, % 
1.48 % 


300 
180 
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3 


590 


SEQ ID NO: 7863 


1.11% II 135 


4 


697 


SEQ ID NO: 7864 


1.11 % ll 135 1 


5 


919 


SEQ ID NO: 7865 


0.74 % II 90 


6 


1354 


SEQ ID NO: 7866 


0.74 % II 90 


7 


1430 


SEQ ID NO: 7867 


0.74 % II 90 


8 


2534 


SEQ ID NO: 7868 


0.74 % H 90 


9 


202 


SEQ ID NO: 7869 


0.49 % II 60 


10 


488 


SEQ ID NO: 7870 


0.49 % H - 60 


11 


922 


SEQ ID NO: 7871 


0.49 % II 60 1 


12 


1735 


SEQ ID NO: 7872 


0.49 % II 60 1 


13 


2281 


SEQ ID NO: 7873 II 0.49 % II 60 | 


14 


1894 


SEQ ID NO: 7874 || 0.44 % II S4 


15 


2552 


SEQ ID NO: 7875 II 0.44 % 


1. 54 1 


16 


555 


SEQ ID NO: 7876 || 0.37 % 


i ■ 45 1 


17 1 


1134 


SEQ ID NO: 7877 || 0.37 % ' 


1' 45 1 


18 i 


1149 


SEQ ID NO: 7878 |j 0.29 % 


' 36 1 


19 


283 


SEQ ID NO: 7879 II 0.24 % = 


30 1 


20 


917 


SEQ ID NO: 7880 || 0.24 % 


30 1 



HLA A24 - 9 mers 


Maximum possible score 


using this molecule tvoe 


1 1596.672 


Rank 


Start position 


Sequence 


1 % of max. score 


1 Score 




1 2375 


SEQ ID NO: 7881 


1 36.07 % 


1 576 


2 


1751 


SEQ ID NO: 7882 


1 28.93 % 


1 462 




195 


SEQ ID NO: 7883 


1 25.05 % 


1 400 


4 


2306 


SEQ ID NO: 7884 


1 21.04 % 


1 336 


5 


806 


SEQ ID NO: 7885 


1 20.66 % 


1 330 


6 


1252 


SEQ ID NO: 7886 


1 18.78 % 


! 300 


7 


160 


SEQ ID NO: 7887 


1 15.03 % 


240 


8 1 


517 


SEQ ID NO: 7888 


15.03 % 


240 


9 1 


375 


SEQ ID NO: 7889 


12.52 % 


200 


10 1 


1275 


SEQ ID NO: 7890 


12.52 % 


200 


11 1 


2175 


SEQ ID NO: 7891 


12.52 % 1 


200 


12 


2207 


SEQ ID NO: 7892 


12.52 % 1 


200 


13 1 


2343 


SEQ ID NO: 7893 1 


12.52 % 1 


200 


14 1 


1 443 


SEQ ID NO: 7894 | 


11.27 % 1 


180 


15 1 


668 


SEQ ID NO: 7895 1 


7.51 % 1 


120 


NH 


1825 


SEQ ID NO: 7896 1 


6.88 % II 110 1 




1690 


SEQ ID NO: 7897 1 


4.69% II 75 T 
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18 


159 


SEO ID NO: 7898 


3.75 % 


60 


19 


2550 


SEQ ID NO: 7899 


3.75 % 


60 


20 


1949 


SEQ ID NO: 7900 


3.38 % 


54 1 






HLA A24 - 10 mers 


Maximum possible score using tills molecule type 


1596.672 


Rank || Start position 


Sequence 


1 9/6 of max. score 


Score 


1 II 641 


i SEQ ID NO: 7901 


1 45.09 % 


720 


2 II 809 II SEQ ID NO: 7902 


I 24.80 % 


396 


3 II 1209 II SEO ID NO: 7903 


1 22.54 % 


360 


4 


! 216 II SEQ ID NO: 7904 


1 18.03 % 


288 


5 


I 159 II SEQ ID NO: 7905 


15.03 % 


240 


6 


528 il SEQ ID NO: 7906 


15.03 % 


240 


7 


799 11 SEQ ID NO: 7907 


15.03 % 


240 


8 


1436 II SEQ ID NO: 7908 


15.03 % 


240 


9 


2219 1 SEQ ID NO: 7909 


15.03 % 


240 


10 II 1065 II SEO ID NO: 7910 


13.77 % 


220 


11 II 1953 II SEQ ID NO: 7911 


13.15 % 


210 


12 


1966 il SEQ ID NO: 7912 


12.52 % 


200 


13 


2600 II SEQ ID NO: 7913 


12.52 % 


200 


14 


71 II SEQ ID NO: 7914 


9.39 % 


150 


15 


380 II SEQ ID NO: 7915 


9.39 % 


150 


16 


1989 II SEQ ID NO: 7916 


9.39 % 


150 


17 


342 II SEQ ID NO: 7917 


8.76 % 


140 


18 


1071 II SEQ ID NO: 7918 


8.76 % 


140 


19 


2570 II SEQ ID NO: 7919 


6.88 % 


110 


20 


2550 11 SEQ ID NO: 7920 


6.26 % 


100 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Ranic 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


1632 


SEQ ID NO: 7921 


0.09 % 


3607.31448 


2 


1 1640 


SEQ ID NO: 7922 


0.04 % 


1748.2560912 


3 


1776 


SEQ ID NO: 7923 


0.03% 


1492.58592 


4 


2512 


SEQ ID NO: 7924 


0.03 % j 


1434.16845 


5 


1073 


SEQ ID NO: 7925 


0.03 % 1 


1338.876 


6 


230 


SEQ ID NO: 7926 


0.01 % 1 


685.78272 


7 


1001 


SEQ ID NO: 7927 


0.01 % 1 


559.8936 


8 


716 


SEQ ID NO: 7928 


0.01 % 1 


558.27486 
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9 


2280 




1 SEQ ID NO: 7929 


1 0.01 % 


II 511.19781048 1 


10 


590 




1 SEQ ID NO: 7930 


1 0.01 % 


^1 469.6692 1 


11 


664 


jSEQ ID NO: 7931 


1 0.01% 


II 442.0763895241 


12 


1094 


1 SEQ ID NO: 7932 


1 0.00 % 


H 382.536 1 


13 


1735 


! SEQ ID NO: 7933 


1 0.00 % 


II 382.536 1 


14 


1625 


1 SEQ ID NO: 7934 


■ 

1 0.00 % 


11 342.4606344 


15 


1974 


!SEQ ID NO: 7935 


1 0.00 % 


II 336.885048 


16 


2382 


SEQ ID NO: 7936 


1 0.00 % 


II 319.9392 


17 


2417 


SEQ ID NO: 7937 


1 0.00 % 


II 319.9392 


18 


744 


SEQ ID NO: 7938 


1 0.00 % 


II 256.416670125 


19 


108 


SEQ ID NO: 7939 


! 0.00 % 


II 232.52724 


20 


390 


SEQ ID NO: 7940 


0.00 % 


II 228.0411084 










HLA A 0201 - 


10 mers 


• — \ 


iMaxImum possible score using this molecule type 


3925227.1 


|Rank||Start position 


Sequence 


^/o of max. score 


Score 


i 1 l| 


2511 


SEQ ID NO: 7941 


0.38 % 


15126.90795 


1 2 1 


1608 


SEQ ID NO: 7942 


0.05 % 


2049.4656 


3 II 


2572 


SEQ ID NO: 7943 


0.04 % 


1879.5921264 


4 II 


255 


SEQ ID NO: 7944 


0.03 % 


1566.6522795 


5 II 895 


SEQ ID NO: 7945 


0.03 % 


1338.876 


6 H 1171 


SEQ ID NO: 7946 


0.02 % 


1107.960876 


7 f 1691 


SEQ ID NO: 7947 


0.01 % 


782.95521024 


8 11 


20 


SEQ ID NO: 7948 


0.01 % 


549.9372312 


9 II 


1632 


SEQ ID NO: 7949 


0.01 % 


479.041993296 


10 II 


2280 


SEQ ID NO: 7950 


0.01 % 


472.418344576987 


11 1 


1963 


SEQ ID NO: 7951 


0.00 % 


358.73928 


12 1 


1955 


SEQ ID NO: 7952 


0.00 % 


331.093464 


13 II 


741 


SEQ ID NO: 7953 


0.00 % 


318.652488 


14 II 


523 


SEQ ID NO: 7954 


0.00 % 


278.7876 


15 1 


1073 


SEQ ID NO: 7955 


0.00 % 


266.6988828 


16 II 


2489 


SEQ ID NO: 7956 


0.00 % 


243.432 


17 1 


777 


SEQ ID NO: 795-7 


0.00 % 


218.5730664 


18 11 


1737 


SEQ ID NO: 7958 


0.00 % 


218.0785572 


19 II 


589 


SEQ ID NO: 7959 


0.00 % 


210.538251 


20 Ij 


229 


SEQ ID NO: 7960 


0.00 % 


205.230564 



HLA A 1101 - 9 mers 



llMaximum possible s^re using this molecule type 

-415- 



X3r 



BNStXXtD: <WO___2004092360A2J_> ■ 



wo 2004/092360 



PCT/US2004/011710 



Rank 


Start position 


Sequence 


*Vb of max. score 


Score 


1 


2337 1 


SEQ ID NO: 7961 


33.33 % 


12 


2 


2156 ! 


SEO ID NO: 7962 


25 % 


9 


3 


492 i 


SEQ ID NO: 7963 


20 % 


7.2 


4 


18 1 


SEQ ID NO: 7964 


16.66 % 


6 


5 


332 


SEQ ID NO: 7965 


16.66 % 


6 


6 


415 1 


SEQ ID NO: 7966 


16.66 % 


6 


7 


2479 j 


SEQ ID NO: 7967 


16.66 % 


- 6 


8 


1495 1 


SEQ ID NO: 7968 


11.11 % 


4 


9 


2035 i 


SEQ ID NO: 7969 


11.11% 


4 


10 


1349 


SEQ ID NO: 7970 


10 % 


3.6 


11 


1194 


SEQ ID NO: 7971 


8.33 % 


3 


12 


1648 


SEQ ID NO: 7972 


8.33 % 


3 


13 


96 


SEQ ID NO: 7973 


6.66 % 


2.4 


14 


764 


SEQ ID NO: 7974 


6.66 % 


2.4 


15 


986 


SEQ ID NO: 7975 


6.66 % 


2.4 


16 


2345 


SEQ ID NO: 7976 


6.66 % 


2.4 


17 


698 


SEQ ID NO: 7977 


5.55 % 


2 


18 


1355 


SEQ ID NO: 7978 


5.55 % 


2 


19 


1987 


SEQ ID NO: 7979 


5.55 % 


2 


20 II 2085 


SEQ ID NO: 7980 


5.55 % 


2 




HLA A llOl - 10 mers 


Maximum possible score using th'\s molecule type 


36 


Rank || Start position 


Sequence 


% of max. score 


Score 


1 1 2083 


SEQ ID NO: 7981 


33.33 % 


12 


2 II 2123 


SEQ ID NO: 7982 


25 % 


9 


3 II 2147 


SEQ ID NO: 7983 


16.66 % 


6 


4 II 331 


SEQ ID NO: 7984 


12.5 % 


4.5 


5 


1 1035 


SEQ ID NO: 7985 


11.11 % 


4 


6 


1 1064 


SEQ ID NO: 7986 


11.11 % 


4 


7 


1 2154 


SEQ ID NO: 7987 


11.11 % 


4 


8 


1 1048 


SEQ ID NO: 7988 


7.5 % 


2.7 


9 


1 202 


SEQ ID NO: 7989 


6.66 % 


2.4 


10 


1 721 


SEQ ID NO: 7990 


6.66 % 


2.4 


11 


1 2109 


SEQ ID NO: 7991 


6.66 % 


2.4 


12 


1 2230 


SEQ ID NO: 7992 


6.66 % 


2.4 


13 


1 1306 


SEQ ID NO: 7993 


5.55 % 


2 


14 


1 1622 


SEQ ID NO: 7994 


5.55 % 


2 



-416- 



BNSDOCID: <WO___2004<»236aA2_l_> 



wo 2004/092360 



PCT/US2004/011710 



15 


1772 


SEQ ID NO: 7995 


5.55 % 


2 


16 


1796 


SEQ ID NO: 7996 


5.55 % 


2 


17 


186 


SEQ ID NO: 7997 


5 % 


1.8 


18 


414 


SEQ ID NO: 7998 


5 % 


1.8 


19 


697 


SEQ ID NO: 7999 


5 % 


1.8 


20 


1175 


SEQ ID NO: 8000 


5 % 


1.8 




HLA B7 - 9 mers 


Maximu 


m possible score usinq this molecule type 


5400 


Rank 


Start position 


Sequence || °/o of max. score 


Score 


1 


1447 


SEQ ID NO: 8001 || 14.81 % 


800 


2 


642 


SEQ ID NO: 8002 !l 3.70 % 


200 


3 


34 


SEQ ID NO: 8003 || 2.22 % 


120 


4 


186 


SEQ ID NO: 8004 || 1.48 % 


I 80 


5 


244 


SEQ ID NO: 8005 11 1.48 % 


80 


6 


459 


SEQ ID NO: 8006 il 1.48 % 


80 


7 


1475 


SEQ ID NO: 8007 || 1.48 % 


80 


8 


1867 


SEQ ID NO: 8008 


1 1.48 % 


80 


9 


2032 


_SEQ ID NO: 8009 


1 1.48 % 


80 


10 
11 


2047 


SEQ ID NO: 8010 


1 1.48 % 


80 


2335 


SEQ ID NO: 8011 


1 1.48 % 


80 


12 


622 


SEQ ID NO: 8012 


1.11 % 


60 


13 


1375 


SEQ ID NO: 8013 


1.11 % 


60 


14 


1617 


SEQ ID NO: 8014 


0.92 % 


50 


15 


1023 


SEQ ID NO: 8015 


0.83 % 


45 


16 


286 


SEQ ID NO: 8016 


0.74 % 


: 40 


17 


490 


SEQ ID NO: 8017 


0.74 % 


40 


18 


810 


SEQ ID NO: 8018 


0.74 % 


40 


19 


1420 


SEQ ID NO: 8019 


0.74 % 


40 


20 


1854 


SEQ ID NO: 8020 


0.74 % 


40 






HLA B7 - 10 mers 


Maximum possible score usinq this molecule type 


5400 


Rank 

1 


Start position 


Sequence 


°/o of max. score 


Score 


1617 


SEQID NO: 8021 I 


3.70 % 


200 


2 


752 


SEQ ID NO: 8022 I 


2.22 % 


120 


3 


1552 


SEQ ID NO: 8023 I 


2.22 % 


120 


4 


154 


SEQID NO: 8024 I 


1.48 % 


80 


5 


165 It SEQID NO: 8025 


1.48 % 


80 



^17- 



BNSDOCID: <VWO__200409236aA2J_> 



wo 2004/092360 



PCT/US2004/011710 



6 


383 


SEO ID NO: 8026 


1.48 % 


1 80 


7 


1501 


SEQ ID NO: 8027 


1.48 % 


1 80 


8 


2093 


SEO ID NO: 8028 


1.48 % 


1 80 


9 


2564 


SEQ ID NO: 8029 


1.48 % 


1 80 


10 


622 


SEQ ID NO: 8030 


1.11 % 


1 60 


11 


1086 


SEQ ID NO: 8031 


1.11 % 


1 60 


12 


1262 


SEO ID NO: 8032 


1.11 % 


1 60 


13 


1556 


SEQ ID NO: 8033 


1.11 % 


1 60 


14 


845 


SEQ ID NO: 8034 


1 % 


1 54 


15 


286 


SEQ ID NO: 8035 


0.74 % 


1 40 


16 


490 


SEQ ID NO: 8036 


0.74 % 


1 40 


17 


552 


SEQ ID NO: 8037 


0.74 % 


1 40 


18 


1858 


SEQ ID NO: 8038 


0.74 % 


1 40 


19 


2107 


SEQ ID NO: 8039 


0.74 % 


1 40 


20 


2582 


SEQ ID NO: 8040 


0.74 % 


1 40 


Table 16: Epitope for SEQ ID NO: 6042 




HLAAl-9mers 


Maximum possible score using tliis molecule type 


5625 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


846 


SEQ ID NO: 8041 


2.22 % 


125 


2 


798 


SEQ ID NO: 8042 


1.6 % 


90 


3 


787 


SEQ ID NO: 8043 


0.88 % 


50 


4 


1178 


SEQ ID NO: 8044 


0.88 % 


50 


5 


637 


SEQ ID NO: 8045 


0.8 % 


45 


6 


557 


SEQ ID NO: 8046 


0.44 % 


25 


7 


1020 


SEQ ID NO: 8047 


0.44 % 


25 


8 


282 


SEQ ID NO: 8048 


0.32 % 


18 


9 


1241 


SEQ ID NO: 8049 


0.24 % 


13.5 


10 


466 


SEQ ID NO: 8050 


0.22 % 


12.5 


11 


727 


SEQ ID NO: 8051 


0.2 % 


11.25 


12 


706 


SEQ ID NO: 8052 


0.17 % 


10 


13 


324 


SEQ ID NO: 8053 


0.16 % 


9 


14 


752 


SEQ ID NO: 8054 


0.16 % 


9 


15 


54 


SEQ ID NO: 8055 


0.13 % 


7.5 


16 


554 


SEQ ID NO: 8056 


0.13 % 


7.5 


17 


590 


SEQ ID NO: 8057 


0.12 % 


1 6.75 



-418- 



wo 2004/092360 



PCT/US2004/011710 



18 


569 


SEO ID NO: 8058 


0.08 % 


5 1 


19 


613 


SEQ ID NO: 8059 


0.08 % 




20 


90 


SEQ ID NO: 8060 


0.08 % 


4.5 






1 HLA Al - 10 mers 


iMaximum possible score using tliis molecule type || qfipq 




1 Start position 


Sequence 


9/o of max. score 


1 Score 


1 


1241 


SEQ ID NO: 8061 


4.8 % 


1 270 


2 


967 


SEQ ID NO: 8062 


0.8 % II 45^ 


3 


1010 


SEQ ID NO: 8063 


0.48 % II ~27 


4 


426 


SEQ ID NO: 8064 


0.44 % II 25 


5 


809 


SEQ ID NO: 8065 


0.44 % II 25 


6 


1178 


SEQ ID NO: 8066 


0.44 % II 25 


7 


787 


SEQ ID NO: 8067 


0.22 % 1 


12.5 


8 


958 


SEQ ID NO: 8068 


0.22 % 1 


12.5 


9 


727 


SEQ ID NO: 8069 


0.2 % 1 


11.25 


10 1 


610 


SEQ ID NO: 8070 


0.17% II 10 


11 1 


12 


SEQ ID NO: 8071 


0.13 % II 7.5 


12 1 


1181 


SEQ ID NO: 8072 


0.12% II 6.75 


13 1 


373 


SEQ ID NO: 8073 


0.11 % II 6.25 


14 1 


602 


SEQ ID NO: 8074 


0.11 % 1 


6.25 


15 1 


20 


SEQ ID NO: 8075 


0.04 % 


2.5 


16- 1 


32 


SEQ ID NO: 8076 


0.04 % II 2.5 


i7n 


53 


SEQ ID NO: 8077 


0.04 % II 2.5 


18 


400 


SEQ ID NO: 8078 


0.04 % II 2.5 


19 


557 


SEQ ID NO: 8079 1 


0.04 % Jl 2.5 


20 1 


667 


SEQ ID NO: 8080 1 


0.04 % il 2^5^ 



Hi-A A3 - 9 mers 


Maximum possible score using this molecule type 


12150 


Ranic 


Start position 


Sequence 


% of max. score 


Score 


1 


768 1 


SEQ ID NO: 8081 I 


0.82 % 


100 


2 


808 


SEQ ID NO: 8082 1 


0.49 % 


60 


3 


85 


SEQ ID NO: 8083 j 


0.24 % 


30 


4 


663 


SEQ ID NO: 8084 I 


0.24 % 1 


30 


5 


1245 


SEQ ID NO: 8085 1 


0.14 % 1 


18 


6 


288 


SEQ ID NO: 8086 1 


0.09 % 1 


12 


7 


50 


SEQ ID NO: 8087 1 


0.08 % 1 




8 


320 


SEQ ID NO: 8088 1 


0.07 % 1 





-419- 



<WO__20(M092380A?_I_> 



wo 2004/092360 



PCTAJS2004/011710 



9 


402 


SEQ ID NO: 8089 


0.07 % 


9 


10 


798 


SEQ ID NO: 8090 


0.07 % 


9 


11 


902 


SEQ ID NO: 8091 


0.06 % 


8.1 


12 


364 


SEQ ID NO: 8092 


0.05 % 


6;75 


13 


297 


SEO ID NO: 8093 


0.04 % 


6 


14 


992 


SEQ ID NO: 8094 


0.04 % 


6 


15 


38 


SEQ ID NO: 8095 


0.03 % 


4.5 


16 


249 


SEQ ID NO: 8096 


0.03 % 


4.5 


17 


706 


SEQ ID NO: 8097 


0.03 % 


4.05 


18 


1204 


SEQ ID NO: 8098 


0.03 % 


4.05 


19 


1178 


SEQ ID NO: 8099 


0.03 % 


4 


20 


343 


SEQ ID NO: 8100 


0.02 % 


3.6 




HLA A3 - 10 mers I 


Maximum possible score using this molecule type 


12150l 


Rank j 


Start position 


Sequence 


9/6 of max. score 


Score I 


• 1 1 


255 


SEQ ID NO: 8101 


1.48 % 


180 1 


2 


180 


SEQ ID NO: 8102 


0.55 % 


67.5 1 


3 


768 


SEQ ID NO: 8103 


0.49 % 


60 1 


4 


1177 


SEQ ID NO: 8104 


0.49 % 


60 1 


5 


380 


SEQ ID NO: 8105 


0.24 % 


30 1 


6 


100 


SEQ ID NO: 8106 


0.18 % 


•22.5 1 




786 


SEQ ID NO: 8107 


0.16% 


1 20 i 


NH 


1217 


SEQ ID NO: 8108 


0.16 % 


1 20 1 


9 


207 


SEQ ID NO: 8109 


0.14 % 


1 18 1 


10 


1183 


SEQ ID NO: 8110 


0.14% 


1 18 1 


11 


38 


SEQ ID NO: 8111 


0.09 % II 12 


12 


52 


SEQ ID NO: 8112 


0.09 % II 12 1 


13 


8 


SEQ ID NO: 8113 


0.06 % II 8 1 


14 


! 679 


SEQ ID NO: 8114 


0.06 % 


8 1 


15 


1 73 


SEQ ID NO: 8115 


0.05 % 


6.75 


16 


i 1204 


SEQ ID NO: 8116 


0.05 % 


6.075 


17 


i 50 


SEQ ID NO: 8117 


0.04 % 


6 


18 


i 774 


SEQ ID NO: 8118 


0.04 % 


6 


19 


1 845 


SEQ ID NO: 8119 


0.04 % 


6 


20 


1 214 


SEQ ID NO: 8120 


0.04 % 


5.4 



HLA A24 - 9 mers 

iMaximum possible score using this molecule type f 1596.672" 

-420- 



BNSDOCID: <WO__a0040923eQA2J_> 



wo 2004/092360 



PCT/US2004/011710 



Rank 


Start position 


1 Sequence 


% of max. score 


Score 


1 


1118 


1 SEQ ID NO: 8121 


19.84 % 


316.8 


2 


51 


I SEQ ID NO: 8122 


18.78 % 


300 


3 


__ 

161 


SEQ ID NO: 8123 


18.78 % 


300 


4 


434 


SEQ ID NO: 8124 


18.78 % 


300 1 


5 


365 


SEQ ID NO: 8125 


13.77 % 


220 


6 


736 


SEQ ID NO: 8126 1 


12.52 % 


200 


7 


620 


SEQ ID NO: 8127 | 


7.51 % 


120 


8 


1068 II SEQ ID NO: 8128 1 


7.51 % 


120 


9 


817 1 


SEQ ID NO: 8129 | 


3.75 % 


60 


10 


336 II SEQ ID NO: 8130 1 


3.44 % 


55 


11 


687 \ 


SEQ ID NO: 8131 1 


3.13 % 


50 


12 


254 1 


LSEQ ID NO: 8132 I 


2.34 % 


37.5 


13 


627 


SEQ ID NO: 8133 1 


1.87 % 


30 


14 


950 1 


SEQ ID NO: 8134 1 


1.75 % 


28 


15 


28 1 


SEQ ID NO: 8135 I 


1.56 % 


' 25 1 


16 


408 1 


SEQ ID NO: 8136 1 


1.56 % 


25 


17 


159 1 


SEQ ID NO: 8137 I 


1.31 % 


21 


18 


1166 1 


SEQ ID NO: 8138 1! 


1.26 % 


20.16 


19 


45 1 


SEQ ID NO: 8139 1 


1.25 % 


20 


1 20 


185 1 


SEQ ID NO: 8140 II 


1.25 % 


20 



HLA A24 - 10 mers 


Maximum possible score using tliis molecule type 


1596.672 


Rank 


Lstart position 


Sequence || o/o of max. score 


Score 


1 


1 438 


SEQ ID NO: 8141 11 


27.55 % 


440 


2 


1 489 


SEQ ID NO: 8142 II 


22.54 % 


360 


3 


1 254 


SEQ ID NO: 8143 II 


18.78 % 


300 


4 


1 354 


SEQ ID NO: 8144 || 


11.27 % 


180 


5 


1 406 


SEQ ID NO: 8145 II 


11.27 % 


180 


6 


i 1047 


SEQ ID NO: 8146 II 


11.27 % 


180 


7 


473 


SEQ ID NO: 8147 H 


7.51 % 


120 


8 


350 


SEQ ID NO: 8148 II 


6.26 % 


100 


9 


769 


SEQ ID NO: 8149 II 


6.26 % 


100 


_io 


193 


SEQ ID NO: 8150 II 


5.63 % 


90 


11 1 


479 


SEQ ID NO: 8151 II 


3.13 % 


50 


12 i 


0 


SEQ ID NO: 8152 II 


2.70 % 


43.2 


13 


813 


SEQ ID NO: 8153 II 


1.87 % 


30 


14 1 


739 


SEQ ID NO: 8154 II 


1.50 % 


24 



-421- 
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15 


782 


SEO ID NO: 8155 


1.50 % 


24 


16 


1186 


SEQ ID NO: 8156 


1.31 % 


21 


1 17 


910 


SEO ID NO: 8157 


1.05 % 


16.8 


18 


128 


SEO ID NO: 8158 


0.93 % 


15 


19 


183 


SEQ ID NO: 8159 


0.93 % 


15 


20 


1069 


SEQ ID NO: 8160 


0.93 % 


15 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


RankI 


Start position 


Sequence 


*>/o of max. score 


Score 


1 1 


1041 


SEQ ID NO: 8161 


0.01 % 


484.2379773 


2 1 


981 


SEQ ID NO: 8162 


0.00 % 


382.536 


3 1 


957 


SEQ ID NO: 8163 


0.00 % 


342.4606344 




896 


SEQ ID NO: 8164 


0.00 % 


232.6931712 


5 ! 


1173 


SEQ ID NO: 8165 


0.00 % 


201.447432 


6 


733 


SEQ ID NO: 8166 


0.00 % 


171.86796 


7 


410 


SEQ ID NO: 8167 || 0.00 % 


135.45252 


8 


786 


SEQ ID NO: 8168 || 0.00 % 


119.463012 


9 


150 


SEQ ID NO: 8169 || 0.00 % 


102.17550222 


10 


1 


SEQ ID NO: 8170 || 0.00 % 


94.98737754 


11 


595 


SEQ ID NO: 8171 || 0.00 % 


93.239424 


12 


1095 


SEQ ID NO: 8172 || 0.00 % 


89.41779 


13 


1 1166 


SEQ ID NO: 8173 || 0.00 % 


87.58584 


14 


1 845 


SEQ ID NO: 8174 !| 0.00 % 


79.642008 


15 


1 734 


SEQ ID NO: 8175 j| 0.00 % 


73.47672 


16 II 802 


SEQ ID NO: 8176 | 0.00 % 


71.872056 


17 > II 1213 


SEQ ID NO: 8177 II 0.00% 


71.872056 


18 II 105 


SEQ ID NO: 8178 || 0.00 % 


50.232 


19 II 939 


SEQ ID NO: 8179 || 0.00 % 


49.13352 


20 II 130 


SEQ ID NO: 8180 || 0.00 % 


48.732354 




HLA A 0201 - 10 mers 


Maximum possible score usinq this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


0/0 of max. score 


Score 


1 


372 


SEQ ID NO: 8181 


0.04 % 


1896.33528 


2 


410 


SEQ ID NO: 8182 


0.02 % 


1134.00849744 


3 


162 


SEO ID NO: 8183 


0.01 % 


685.3897512 


4 


1076 


SEO ID NO: 8184 


0.01 % 


640.90320525 


5 


1196 


SEO ID NO: 8185 


0.01 % 


623.7426663721 



-422- 
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wo 2004/092360 
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6 


353 


SEQ ID NO: 8186 


0.01 % 


446.7384576 


7 


50 


SEQ ID NO: 8187 


0.00 % 


1 375.97824 


8 


733 


SEQ ID NO: 8188 


0.00 % 


271.863864 


9 


130 


SEQ ID NO: 8189 


0.00 % 


235.6873848 


10 


415 


SEQ ID NO: 8190 


0.00 % 


185.679 i 


11 


297 


SEQ ID NO: 8191 


0.00 % 


177.496704 1 


12 


1 


SEQ ID NO: 8192 


0.00 % 


152.42160582 1 


13 


56 


SEQ ID NO: 8193 


0.00 % 


110.013876 1 


14 


732 


SEQ ID NO: 8194 


0.00 % 


101.0988 1 


15 


6 


SEQ ID NO: 8195 


0.00 % 


98.26704 1 


16 


261 


SEQ ID NO: 8196 


0.00 % 


91.60164 1 


17 


1040 


SEQ ID NO: 8197 


0.00 % 


76.98537 1 


18 


928 


SEQ ID NO: 8198 


0.00 % 1 


71.2908 1 


19 


1188 


SEQ ID NO: 8199 


0.00 % i 


69.81282 . 1 


20 


1094 


SEQ ID NO: 8200 


0.00% II 52.5987 1 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


1 36 


Rank li Start position 


Sequence || 


°/o of max. score 


1 Score 


1 




1 402 


SEQ ID NO: 8201 II 


25 % 


1 9 


2 




1 902 


SEQ ID NO: 8202 11 


22.5 % 


1 8.1 


3 




1 288 


SEQ ID NO: 8203 II 


11.11 % 


1 4 






1 85 


SEQ ID NO: 8204 || 


6.66 % 


1 2.4 






1 706 " 


SEQ ID NO: 8205 || 


6.66 % 


2.4 


6 




1 456 


SEQ ID NO: 8206 II 


5.55 % 


2 


7 


II 920 


SEQ ID NO: 8207 H 


5.55 % 


2 


8 




1 535 


SEQ ID NO: 8208 || 


5 % 


1.8 


9_ 




I 364 


SEQ ID NO: 8209 II 


3.33 % 


1.2 


10 


II 438 


SEQ ID NO: 8210 II 


3.33 % 


1.2 


11 




798 


SEQ ID NO: 8211 || 


3.33 % 


1.2 


12 




808 


SEQ ID NO: 8212 || 


3.33 % 


1.2 


13 




937 


SEQ ID NO: 8213nr 


3.33 % 


1.2 


14 




956 


SEQ ID NO: 8214 II 


3.33 % 


1.2 


15 


II 557 


SEQ ID NO: 8215 || 


2.77 % 


1 


16 




1218 


SEQ ID NO: 8216 II 


2.77 % 


1 


17 


784 


SEQ ID NO: 8217 || 


2.5 % 


0.9 


18 


249 


SEQ ID NO: 8218 II 


2.22 % 


0.8 


19 1 


768 


SEQ ID NO: 8219 II 


2.22 % 


0.8 


1 20 1 


1178 


SEQ ID NO: 8220 II 


2.22 % 


0.8 



-423- 



<WO__2CI040923e0A2J_> 



wo 2004/092360 



PCT/US2004/011710 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 




Start position 


Sequence 


9/6 of max. score 


Score 


1 


38 


SEQ ID NO: 8221 


13.33 % 


4.8 


2 


807 


SEQ ID NO: 8222 


12.5 % 


4.5 


3 


100 


SEQ ID NO: 8223 


11.11 % 


4 


4 


380 


SEQ ID NO: 8224 


11.11 % 


- 4 


5 


767 


SEQ ID NO: 8225 


10 % 


3.6 


6 


533 


SEQ ID NO: 8226 


8.33 % 


3 1 


7 


967 


SEQ ID NO: 8227 


6.66 % 


2.4 1 


8 


919 


SEQJD NO: 8228 


5.55 % 


2 1 


9 


305 


SEQ ID NO: 8229 


5 % 


1.8 1 


10 


211 


SEQ ID NO: 8230 


3.33 % 


1.2 1 


11 


511 


SEQ ID NO: 8231 


3.33 % 


/ 1.2 1 


12 


1177 


SEQ ID NO: 8232 


3.33 % 


1.2 1 


13 


429 


SEQ ID NO: 8233 


2.77 % 


1 J 


. 14 


758 


SEQ ID NO: 8234 


2.77 % 


1 1 


15 


797 


SEQ ID NO: 8235 


2.5 % • 


0.9 1 


16 


255 


SEQ ID NO: 8236 


2.22 % 


0.8 1 


17 


986 


SEQ ID NO: 8237 


2.22 % 


0.8 1 


18 


1157 


SEQ ID NO: 8238 


2.22 % ; 


. 0.8 1 


19 


170 


SEQ ID NO: 8239 


1.66 % : 


. 0.6 1 


20 


893 


SEQ ID NO: 8240 


1.66 % 


0.6 1 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start position 


Sequence 


9/o of max. score 


Score 


1 


200 


SEQ ID NO: 8241 


1.48 % 


80 


2 


1243 


SEQ ID NO: 8242 


1.48 % 


80 


3 


123 


SEQ ID NO: 8243 


0.74 % 


40 


4 


248 


SEQ ID NO: 8244 


0.66 % 


36 


5 


1036 


SEQ ID NO: 8245 


0.66 % 


36 


6 


494 


SEQ ID NO: 8246 


0.37 % 


20 


7 


495 


SEQ ID NO: 8247 


0.37 % 


20 


8 


523 


SEQ ID NO: 8248 


0.37 % 


20 


9 


842 


SEQ ID NO: 8249 


0.37 % 


20 


10 


932 


SEQ ID NO: 8250 


0.37 % 


20 


11 


274 


SEQ ID NO: 8251 


0.33 % 


18 



-424- 



BNSDOCID: <WO__2004092360A2_L> 



wo 2004/092360 



PCT/US2004/011710 



12 


588 


SEQ ID NO: 8252 


0.22 % 


12 


13 


656 


SEQ ID NO: 8253 


0.22 % 


12 


14 


657 


SEQ ID NO: 8254 


0.22 % 


12 


15 


767 


SEQ ID NO: 8255 


0.22 % 


12 


16 


911 


SEQ ID NO: 8256 


0.22 % 


12 


17 


939 


SEQ ID NO: 8257 


0.22 % 


12 


18 


1007 


SEQ ID NO: 8258 


0.22 % 


12 


19 


1170 


SEQ ID NO: 8259 


0.22 % 


- 12 


20 


1206 


SEQ ID NO: 8260 


0.22 % 


12 






HLA B7 - 10 mers 


Maximum possible score using tliis molecuie type 


5400 


Rank 


Start position 


1 Sequence 


9/6 of max. score 


Score 


1 


505 


1 SEQ ID NO: 8261 


4.44 % 


240 


2 


312 


1 SEQ ID NO: 8262 


3.70 % 


200 


3 


141 


1 SEQ ID NO: 8263 


1.11 % 


60 


4 


1006 


1 SEQ ID NO: 8264 


0.66 % 


36 


5 


411 


1 SEQ ID NO: 8265 


0.44 % 


24 


6 


122 


1 SEQ ID NO: 8266 


0.37 % 


20 


7 


134 


1 SEQ ID NO: 8267 


0.37 % 


20 


8 


184 


i SEQ ID NO: 8268 


0.37 % 


20 


9 


367 


1 SEQ ID NO: 8269 


0.37 % 


20 


10 


402 


1 SEQ ID NO: 8270 


0.37 % 


20 


11 


494 


1 SEQ ID NO: 8271 


0.37 % 


20 


12 


560 


1 SEQ ID NO: 8272 


0.37 % 


20 


13 


626 


SEQ ID NO: 8273 


0.37 % 


20 


14 


931 


SEQ ID NO: 8274 


0.37 % 


20 


15 


956 


SEQ ID NO: 8275 


0.37 % 


20 


16 


1117 


SEQ ID NO: 8276 || 0.37 % 


20 


17 


1169 


SEQ ID NO: 8277 


0.37 % 


20 


18 


1196 


SEQ ID NO: 8278 


0.37 % 


20 


19 


247 


SEQ ID NO: 8279 


0.22 % 


12 


20 


273 


SEQ ID NO: 8280 


0.22 % 


12 



Table 17: Epitopes for SEQ ID NO: 6043 



HLA Al - 9 mers 



llt^aximum possible score using this molecule type || 5625" 



BNSDOCID: <WOl__2004092360A2_I_> 



wo 2004/092360 



PCT/US2004/011710 



Rank 1 


Start position 


Sequence 


9/6 of max. score 


Score 1 


1 1 


168 


SEQ ID NO: 8281 


0.2 % 


11.25 1 


2 J 


212 


SEQ ID NO: 8282 


0.08 % 


4.5 1 


3 


223 


SEQ ID NO: 8283 


0.08 % 


4.5 1 


4 


104 


SEQ ID NO: 8284 


0.04 % 


2.5 i 


5 


170 


SEQ ID NO: 8285 


0.04 % 


2.5 1 


6 


99 


SEQ ID NO: 8286 


0.04 % 


2.25 1 


7 


188 


SEQ ID NO: 8287 


0.02 % 


-1.35 1 


8 


180 


SEQ ID NO: 8288 


0.02 % 


1.25 


9 


219 


SEQ ID NO: 8289 


0.02 % 


1.25 


10 


18 


SEQ ID NO: 8290 


0.01 % 


1 1 


11 


226 


SEQ ID NO: 8291 


0.01% 


1 


12 


98 


SEQ ID NO: 8292 


0.01 % 


0.625 


13 


151 


SEQ ID NO: 8293 


o.oi;% 


0.625 


14 


10 


SEQ ID NO: 8294 


0.01% 


0.6 


15 


13 


SEQ ID NO: 8295 


0.00% 


0.5 


16 


32 


SEQ ID NO: 8296 


0.00 % 


0.5 


17 


70 


SEQ ID NO: 8297 


0.00% 


0.5 


18 


78 


SEQ ID NO: 8298 


0.00% 


0.5 


19 


82 


SEQ ID NO: 8299 


0.00% 


0.5 


20 


145 


SEQ ID NO: 8300 


0.00% 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type || 5625 


Rank 


Start position 


Sequence 


9/6 of max. score li Score 


1 


99 


SEQ ID NO: 8301 


0.8 ;% II 45 


2 


223 


1 SEQ ID NO: 8302 


0.8% II 45 


3 


188 


1 SEQ ID NO: 8303 


0.48 % II 27 


4 


206 


1 SEQ ID NO: 8304 


0.2% II 11.25 


5 


253 


1 SEQ ID NO: 8305 


0.17% II 10 


6 


174 


1 SEQ ID NO: 8306 


0.13 % II 7.5 


7 


97 


1 SEQ ID NO: 8307 


0.04 % II 2.5 


8 


257 


1 SEQ ID NO: 8308 


0.04 % II 2.5 


9 


179 


1 SEQ ID NO: 8309 


0.04 % H 2.25 


10 


162 


1 SEQ ID NO: 8310 


0.02 % II 1.25 


11 


196 


1 SEQ ID NO: 8311 


0.02 % II 1.25 


12 


219 


1 SEQ ID NO: 8312 


0.02 % II 1.25 


13 


18 


1 SEQ ID NO: 8313 


0.01 % 11 1 


14 


246 


1 SEQ ID NO: 8314 


0.01 % 11 1 



-426- 



BNSDOCID: <WO__2004092360A2_L> 



wo 2004/092360 



PCT/US2004/011710 



— li — ""^t::^ 

15 II 38 


SEQ ID NO: 8315 


0.01 % ll 0.7^ 


16 II 33 


SEQ ID NO: 8316 


0.00 % 


1 0 5 1 


7^ li TTZ 

17 11 69 


SEQ ID NO: 8317 


0.00 % 


II 0.5 


18 II 81 


SEQ ID NO: 8318 


0.00 % 


1 0.5 


19 II 104 


SEQ ID NO: 8319 


0.00 % 


1 0.5 


20 II 116 


SEQ ID NO: 8320 


0.00 % 


1 0.5 




li HLA A3 - 9 mers 


Maximt 


m possible score usinq this molecule type 


12150 


Jlank 


Start position || Sequence II o/^ of max. srnrp^ 


120 


1 


104 


SEQ ID NO: 8321 H 0.98 % i 


2 


123 


SEQ ID NO: 8322 II 0.74 % 


90 


3 


82 


SEQ ID NO: 8323 II 0.44 % 


54 


4 


106 


SEQ ID NO: 8324 || 0.11 % 


13.5 


5 


99 


SEQ ID NO: 8325 || 0.08% 


10.8 1 


6 


127 


SEQ ID NO: 8326 II 0.08 % 


10 


7 


71 


SEQ ID NO: 8327 || 0.07 % 


9 


8 


1 


SEQ ID NO: 8328 


0.06 % 


8.1 1 


9 


113 


' SEQ ID NO: 8329 


0.04 % 




10 


84 


SEQ ID NO: 8330 I 


0.03 % 




11 


109 


SEQ ID NO: 8331 


0.03 % 


4.05 1 


12 


58 


SEQ ID NO: 8332 || 0.02 % 


3 


13 


138 


SEQ ID NO: 8333 || 0.02 % 


3 1 


14 


44 


SEQ ID NO: 8334 || 0.02 % 


2.7 1 


15 


81 


SEQ ID NO: 8335 II 0.02% 


2.7 1 


16 


226 


SEQ ID NO: 8336 || 0.02 % | 


2.7 1 


17 


184 


SEQ ID NO: 8337 II 0.01% 


1.8 1 


18 


102 


SEQ ID NO: 8338 II 0.01 o/„ 


1.215 


19 


39 


SEQ ID NO: 8339 


0.00 % 


1.2 


20 


234 


SEQ ID NO: 8340 I 


0.00 % 1 


0.9 1 



HLA A3 - 10 mers 


Maximt 


m possible score usinq this molecule tvoe 


12150 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


99 


SEQ ID NO: 8341 


1.33 % 


162 


2 


81 


SEQ ID NO: 8342 


0.44 % 


54 


3 




SEQ ID NO: 8343 


0.24 % 


30 


4 


51 1 


SEQ ID NO: 8344 1 


0.16% 1 


20 


5 


122 


SEQ ID NO: 8345 


0.11% 1 


13.5 



-427- 



BNSDOCID: <WO__aX)4092360A2J_> 



wo 2004/092360 



PCTAJS2004/011710 



6 


1 


SEO ID NO: 8346 


0.07 % 


9 


7 


69 1 


SEQ ID NO: 8347 


0.04 % 


6 


8 


223 1 


SEO ID NO: 8348 


0.04 % 


5.4 


9 


84 j 


SEQ ID NO: 8349 


0.03 % 


4.5 


10 


63 1 


SEO ID NO: 8350 


0.02 % 


3.6 


11 


138 1 


SEQ ID NO: 8351 


0.02 % 


3 


12 


201 1 


SEQ ID NO: 8352 


1 0.01% 


1.8 


13 


44 1 


SEQ ID NO: 8353 


j 0.01% 


-1.35 


14 


83 


SEQ ID NO: 8354 


1 0.01% 


1.35 


15 


116 1 


SEQ ID NO: 8355 


1 0.00 % 


1.2 


16 


46 1 


SEQ ID NO: 8356 


1 0.00% 


0.9 


17 


183 


SEQ ID NO: 8357 


1 0.00 % 


0.81 


18 


57 


SEQ ID NO: 8358 


1 0.00 % 


0.6 


19 


93 


SEQ ID NO: 8359 ll 0.00 % 


0.6 


20 


113 


SEQ ID NO: 8360 II 0.00 % 


0.6 



HLA A24 - 9 mers 



Maximum possible score using this molecule type 


1596.672 1 


Rank 


Start position |i Sequence 


o/o of max. score 


Score I 


1 


198 


I! SEQ ID NO: 8361 


13.15 % 


210 1 


2 1 


105 


II SEQ ID NO: 8362 


9.39 % 


150 1 


3 1 


210 


II SEQ ID NO: 8363 


4.69 % 


75 1 


4 1 


75 


I SEQ ID NO: 8364 


3.15 % 


50.4 1 




85 


11 SEQ ID NO: 8365 


2.63 % 


42 1 


6 1 


205 


11 SEQ ID NO: 8366 


2.10 % 


r 33.6 i 




77 


II SEQ ID NO: 8367 


1.87 % 


1 30 


NH 


158 


11 SEQ ID NO: 8368 


0.65 % 


1 10.5 


9 i 


103 


II SEQ ID NO: 8369 


0.56 % 


1 9 1 


10 


227 


II SEQ ID NO: 8370 


0.55 % 


1 8.8704 


11 


32 


11 SEQ ID NO: 8371 


0.54 % 


1 8.64 


12 


74 


II SEQ ID NO: 8372 


0.50 % 


1 8 


13 


131 


11 SEQ ID NO: 8373 


0.50 % 


1 8 


14 


54 


11 SEQ ID NO: 8374 


0.46 % 


1 7.5 


15 


99 


11 SEQ ID NO: 8375 


0.45 % 


11 7.2 


16 


44 


II SEQ ID NO: 8376 


1 0.37 % 


II 6 


17 


62 


II SEQ ID NO: 8377 


1 0.37 % 


II 6 


18 


87 


II SEQ ID NO: 8378 


1 0.37 % 




19 


89 


II SEQ ID NO: 8379 


1 0.37 % 




20 


154 


II SEQ ID NO: 8380 


1 0.37 % 


J 6 



-428- 



BNSEXXID: <WO__200409236QA2J_3 



wo 2004/092360 



PCT/US2004/011710 




HLA A 0201 - 9 mers 



Maxim 


um possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


138 


SEQ ID NO: 8401 


0.21 % 


8532.082944 


2 


106 


SEQ ID NO: 8402 


0.10 % 


1 3977.8497792 


1 3 


44 


SEQ ID NO: 8403 


0.03 % 


1 1243.078056 


1 4 


71 


SEQ ID NO: 8404 


0.00 % 


1 348.872832 


1 5 


234 


SEQ ID NO: 8405 


0.00 % 


1 243.432 


1 6 


51 


SEQ ID NO: 8406 


0.00 % 


1 130.26096 




109 


SEQ ID NO: 8407 


0.00 % 


1 91.182672 


1 8 


81 


SEQ ID NO: 8408 


0.00 % 


1 ,73.342584 


1 9 


88 


SEQ ID NO: 8409 


0.00 % 


70.386624 


10 


1 


SEQ ID NO: 8410 


0.00 % 


65.32728732 


11 


38 


SEQ ID NO: 8411 


0.00 % 


47.876409 



-429- 



BNSDOCID: <WO__2004092360A2J_> 



wo 2004/092360 



PCT/US2004/011710 



12 


76 


SEQ ID NO: 8412 


0.00 % 


36.8637882 


13 


46 


SEQ ID NO: 8413 


0.00 % 


30.889782 


14 


211 


SEQ ID NO: 8414 


0.00 % 


21.616753941 


15 


201 


SEQ ID NO: 8415 


0.00 % 


19.657134 


16 


102 


SEQ ID NO: 8416 


0.00 % 


18.4318941 


17 


199 


SEQ ID NO: 8417 


0.00 % 


16.496865 


18 


74 


SEQ ID NO: 8418 


0.00 % 


15.783256167 


19 


62 


SEQ ID NO: 8419 


0.00 % 


13.9968225 


20 


99 


SEQ ID NO: 8420 


0.00 % 


10.31851392 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


78 


SEQ ID NO: 8421 


0.01 % 


556.494246 


2 


138 


SEQ ID NO: 8422 


0.01 % 


395.245972224 


3 


84 


SEQ ID NO: 8423 


0.00 % 


201.554244 


4 


71 


SEQ ID NO: 8424 


0.00 % 


143.65707264 


5 


44 


SEQ ID NO: 8425 


0.00 % 


132.54624 


6 


76 


SEQ ID NO: 8426 


0.00 % 


84.78671286 


7 


8 


SEQ ID NO: 8427 


0.00 % 


69.552 


8 


211 


SEQ ID NO: 8428 


0.00 % 


52.7237901 


9 


113 


SEQ ID NO: 8429 


0.00 % 


47.99088 


10 


61 


SEQ ID NO: 8430 


0.00 % 


37.4509575 


11 


93 


SEQ ID NO: 8431 


0.00 % 


31.24872 


12 


137 


SEQ ID NO: 8432 


0.00 % 


31.1384304 


13 


37 


SEQ ID NO: 8433 


0.00 % 


27.531 


14 


55 


SEQ ID NO: 8434 


0.00 % 


22.9153278 


15 


98 


SEQ ID NO: 8435 


0.00 % 


22.1063618985 


16 


108 


SEQ ID NO: 8436 


0.00 % 


21.55457052 


17 


63 


SEQ ID NO: 8437 


0.00 % 


21.3624 


18 


45 


SEQ ID NO: 8438 


0.00 % 


19.657134 


19 


200 


SEQ ID NO: 8439 


0.00 % 


19.657134 


20 


104 


SEQ ID NO: 8440 


0.00% II 13.87622016 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


Ranic 


Start position |i Sequence || <Vb of max. score 


Score 


1 


58 II SEQ ID NO: 8441 || 5.55 % 


2 


2 


125 II SEQ ID NO: 8442 || 1.66 % 


0.6 



-430- 



BNSDOCID: <W O g 00409236Qft2 I > 



wo 2004/092360 



PCT/US2004/011710 



3 


226 


SEO ID NO: 8443 


1 1.66 % 


0.6 


4 


229 


SEO ID NO: 8444 


1 1.66 % 


0.6 






HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start {position 


Sequence 


9/6 of max. score 


Score 


1 


122 


SEO ID NO: 8445 


2.22 % 


0.8 


2 


228 


SEO ID NO: 8446 


2.22 % 


0.8 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


1 5400 


Rank 


Start position 


Sequence 




9/o of max. score 


Score 


1 


97 


SEQ ID NO: 8447 




0.66 % 


I 35 


2 


86 


SEQ ID NO: 8448 


h 


0.37 % 


1 20 


3 


37 


SEQ ID NO: 8449 




0.33 % • 


r 18 


4 


62 


SEQ ID NO: 8450 




0.33 % 


P 18 1 


5 


32 


SEQ ID NO: 8451 




0.22 % 


12 


6 


102 


SEQ ID NO: 8452 




0.22 % • 


! ; 12 


7 


227 


SEQ ID NO: 8453 




0.22 % 


12 


8 


53 


SEQ ID NO: 8454 




0.11 % 




9 


1 


SEQ ID NO: 8455 




0.07 % 


m 


10 


44 


SEQ ID NO: 8456 II 


0.07 % 


4 


11 


56 


SEQ ID NO: 8457 




0.07% 




12 


64 


SEQ ID NO: 8458 




0.07 % 




13 


74 


SEQ ID NO: 8459 




0.07 % 


4 


14 


76 


SEQ ID NO: 8460 || 


0.07 % 


4 


15 


87 


SEQ ID NO: 8461 ! 




0.07 % 


4 


16 


106 


SEQ ID NO: 8462 I 




0.07 % 


4 


17 II 131 


SEQ ID NO: 8463 I 




0.07 % i 


4 


18 


23 


SEQ ID NO: 8464 i 




0.03 % 1 


2 


19 


157 


SEQ ID NO: 8465 I 




0.03 % ! 


2 


20 i 


166 


SEQ ID NO: 8466 II 


0.03 % 1 


2 



HLA B7 - 10 mers 


Maximu 


m possible score using this molecule type 


5400 


Rank 


Start position 


Sequence 


o/o of max. score 


Score 


1 


119 


SEQ ID NO: 8467 


3.33 % 


180 


2 


264 


SEQ ID NO: 8468 


1.48 % 


80 


3 


98 


SEQ ID NO: 8469 


0.66 % 


36 



-431- 



BNSOOCID: <WO_2004092360A2J_> 



wo 2004/092360 



PCT/US2004/0 11710 



4 


27 


SEO ID NO: 8470 


0.37 % 


20 


5 


86 


SEO ID NO: 8471 


0.37 % 


20 


6 


31 


SEO ID NO: 8472 


0.22 % 


12 


7 


63 


SEO ID IMO: 8473 


0.22 % 


12 


8 


96 


SEO ID NO: 8474 


0.22 % 


12 


9 


101 


SEO ID NO: 8475 


0.22 % 


12 


10 


226 


SEO ID NO: 8476 


0.22 % 


12 1 


11 


157 


SEO ID NO: 8477 


0.14% 


- 8 ' 1 


12 


176 


SEO ID NO: 8478 


0.14 % 


8 1 


13 


238 


SEO ID NO: 8479 


0.14:% 


^ ! 


14 


36 


SEO ID NO: 8480 


0.11 % 


6 


15 


53 


SEO ID NO: 8481 


0.11 % 


6 


16 


61 


SEQ ID NO: 8482 


0.11 % 


6 1 


17 


3 


SEQ ID NO: 8483 


0.07% 


4 1 


18 


40 


SEQ ID NO: 8484 


0.07 % 


4 


19 


55 


SEQ ID NO: 8485 


0.07 % 


4 


20 


74 


SEQ ID NO: 8486 


0.07% 


4 


Table 18: Epitopes for SEQ ID NO: 6044 


HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 1 


Rank 


Start position 


Sequence 


^/o of max. score 


Score 


1 


69 


SEQ ID NO: 8487 


0.04;% 


2.5 1 


2 


89 


SEQ ID NO: 8488 


0.02 % 


Ts 1 


3 


141 


SEO ID NO: 8489 


0.01% 


1 


4 


113 


SEQ ID NO: 8490 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


*Vo of max. score 


Score 


1 


21 1 


SEQ ID NO: 8491 


0.02 % 


1.5 


2 


88 1 


SEQ ID NO: 8492 


0.02 % 


1.5 


3 


8 1 


SEQ ID NO: 8493 


0.02 % 


1.25 


4 


31 1 


SEQ ID NO: 8494 


0.00 % 


0.5 


5 


112 1 


SEQ ID NO: 8495 


0.00 % 


0.5 



5 




HLA A3 - 9 mers 



-432- 



wo 2004/092360 



PCT/US2004/011710 



Maxlmt 


rm possible score using tills molecule type 


12150 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


60 


SEO ID NO: 8496 


1.23 % 


150 


2 


77 


SEQ ID NO: 8497 


1.11 % 


135 


3 


141 


SEO ID NO: 8498 


0.49 % 


60 


4 


95 


SEQ ID NO: 8499 


0.32 % 


40 


5 


128 


SEO ID NO: 8500 


0.08 % 


10 


6 


113 


SEQ ID NO: 8501 


0.04 % 


6 


7 


69 


SEQ ID NO: 8502 


0.01 % 


2 


8 


22 


SEQ ID NO: 8503 II 0.01 % 


1.8 


9 


42 


SEQ ID NO: 8504 || 0.01 % 


1.8 


10 


78 


SEQ ID NO: 8505 I 


0.00 % 


1.2 


11 


32 


SEQ ID NO: 8506 I 


0.00 % 


1 


12 


54 


SEQ ID NO: 8507 | 


0.00 % 


0.9 


13 


74 


SEQ ID NO: 8508 I 


0.00% 


0.9 


14 


28 


SEQ ID NO: 8509 || 0.00 % 


0.6 


15 


36 


SEQ ID NO: 8510 || 0.00 % 


0.6 


16 


48 


SEQ ID NO: 8511 | 


0.00 % 


0.6 


17 


118 


SEQ ID NO: 8512 I 


0.00 % 


0.6 


18 


4 


SEQ ID NO: 8513 i 


0.00 % 


0.5 




H LA A3 - 10 mers 


Maximu 


m possible score using this nnolecule type 


12150 


Rank 


Start position 


Sequence I 


9/6 of max. score 


Score 


1 


94 


SEQ ID NO: 8514 1 


0.49 % 


60 


2 


48 


SEQ ID NO: 8515 | 


0.16 % 1 


20 


3 


1 128 


SEQ ID NO: 8516 \ 


0.16% 1 


20 


4 


60 


SEQ ID NO: 8517 | 


0.12% 1 


15 


5 


127 


SEQ ID NO: 8518 1 


0.12% 1 


15 


6 


25 


SEQ ID NO: 8519 i 


0.04 % 1 


6 


7 


95 


SEQ ID NO: 8520 1 


0.04 % 1 


6 


8 


141 


SEQ ID NO: 8521 | 


0.04 % i 


6 


9 


41 


SEQ ID NO: 8522 f 


0.04 % i 


5.4 


10 


77 


SEQ ID NO: 8523 1 


0.04 % 1 


5.4 


11 


116 


SEQ ID NO: 8524 || 


0.04 % 1 


5.4 


12 


91 


SEQ ID NO: 8525 || 


0.03 % 1 


4 


13 


4 


SEQ ID NO: 8526 1 


0.01 % 1 


2 


14 


112 


SEQ ID NO: 8527 II 


0.01 % 1 


1.8 


15 


113 


SEQ ID NO: 8528 || 


0.01 % 1 


1.35 1 
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16 


12 


SEQ ID NO: 8529 


0.00 % 


1.2 


17 


31 


SEQ ID NO: 8530 


0.00 % 


1 


18 


32 


SEQ ID NO: 8531 


0.00 % 


1 


19 


15 


SEQ ID NO: 8532 


0.00 % 


0.9 


20 


27 


SEQ ID NO: 8533 


0.00% 


0.9 




HLA A24 - 9 mers 


Maximum possible score using this molecule ty 


pe 


1596.672 


Rank 


Start position 


Sequence | 


°/o of max. score 


Score 


1 


61 


SEQ ID NO: 8534 


14.46 % 


231 


2 


16 


SEQ ID NO: 8535 | 


3.13 % 


50 


3 


120 


SEQ ID NO: 8536 


1.87 % 


30 


4 


41 


SEQ ID NO: 8537 


0.60 % 


9.6 


5 


71 


SEQ ID NO: 8538 


0.45 % 


7.2 


6 


21 


SEQ ID NO: 8539 I 


0.37 % 


6 


7 


53 


SEQ ID NO: 8540 


0.37 % 


6 


8 


65 


SEQ ID NO: 8541 I 


0.37 % 


6 


9 


121 


SEQ ID NO: 8542 | 


0.37 % 


6 


10 


74 


SEQ ID NO: 8543 | 


0.36 % ■ 


5.76 


11 


20 


SEQ ID NO: 8544 | 


0.35% 


5.6 


12 


79 


SEQ ID NO: 8545 | 


0.35 % 


5.6 


13 


105 


SEQ ID NO: 8546 i 


0.33 % 


5.28 


14 


48 


SEQ ID NO: 8547 | 


0.30 % : 


4.8 


15 


88 


SEQ ID NO: 8548 I 


0.30 % 


4.8 


16 


106 


SEQ ID NO: 8549 | 


0.30 % 


4.8 


17 


37 


SEQ ID NO: 8550 | 


0.27 % 


4.4 


18 


70 


SEQ ID NO: 8551 I 


0.27 % 


4.4 


19 


18 


SEQ ID NO: 8552 I 


0.25 % 


4 


20 


57 


SEQ ID NO: 8553 1 


0.22 % 


3.6 



HLA A24 - 10 mers 


Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


120 


SEQ ID NO: 8554 


1.87 % 


30 


2 


73 


SEQ ID NO: 8555 


0.54 % 


8.64 


3 


19 


SEQ ID NO: 8556 


0.52 % 


8.4 


4 


78 


SEQ ID NO: 8557 


0.52 % 


8.4 


5 


104 


SEQ ID NO: 8558 


0.49 % 


7.92 


6 


61 


SEQ ID NO: 8559 


0.46 % 


7.5 
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7 


47 


SEQ ID NO: 8560 


0.45 % 


1 7.2 


8 


36 


SEQ ID NO: 8561 


0.41 % 


1 6.6 


9 


52 


SEQ ID NO: 8562 


0.37 % 


1 ,6 


10 


64 


SEQ ID NO: 8563 


0.30 % 


1 4.8 


11 


70 


SEQ ID NO: 8564 


0.30 % 


i 4.8 


12 


105 


SEQ ID NO: 8565 


0.30 % 


! 4.8 


13 


123 


SEQ ID NO: 8566 


0.30 % 


4.8 1 


14 


69 


SEQ ID NO: 8567 


0.27 % 


4.4 1 


15 


20 


SEQ ID NO: 8568 


0.25 % 


4 1 


16 


66 


SEQ ID NO: 8569 


0.25 % 


4 1 


17 


83 


SEQ ID NO: 8570 


0.25 % 




18 


86 


SEQ ID NO: 8571 


0.25 % 




Ll9_.. 


101 


SEQ ID NO: 8572 


0.25 % 


:. 4 1 


L20 


119 


SEQ ID NO: 8573 


0.25 % 1 


' 4 1 







HLA A 0201 - 


9 mers 




Maximum possible score using this molecule type 


3925227.1 


Rank 


1 start position 


Sequence li 


9/o of max. score 


Score 


1 


62 


SEQ ID NO: 8574 11 


0.00 % 


136.1646 


2 


85 


SEQ ID NO: 8575 || 


0.00 % 


69.6969 


3 


47 


SEQ ID NO: 8576 || 


0.00 % 


60.153786 


4 


121 


SEQ ID NO: 8577 il 


0.00 % 


52.5182736 


5 


74 


SEQ ID NO: 8578 || 


0.00 % 


49.13352 




23 


SEQ ID NO: 8579 11 


0.00 % 


21.99582 


7 


78 


SEQ ID NO: 8580 || 


0.00 % 


19.42488 


8 


114 


SEQ ID NO: 8581 || 


0.00 % 


14.6900655 


9 




SEQ ID NO: 8582 !l 


0.00 % 


11.304684 






SEQ ID NO: 8583 || 


0.00 % 


8.4687081 




122 ! 


SEQ ID NO: 8584 || 


0.00 % 


6.0996 1 


12 


100 ! 


SEQ ID NO: 8585 || 


0.00 % 


5.382 


13 1 


105 1 


SEQ ID NO: 8586 || 


0.00 % 


4.981593 


14 1 


25 1 


SEQ ID NO: 8587 || 


0.00 % 


4.968 


15 1 


115 1 


SEQ ID NO: 8588 || 


0.00 % 


4.966482 


16 1 


24 


SEQ ID NO: 8589 11 


0.00 % 


4.4815221585 I 


17 1 


111 


SEQ ID NO: 8590 || 


0.00 % 


4.128201 1 


18 1 


94 


SEQ ID NO: 8591 || 


0.00 % 


3.67632 


ImJ 


34 


SEQ ID NO: 8592 || 


0.00 % 


3.47553 1 




12 


SEQ ID NO: 8593 II 


0.00 % 


3.30993 1 



-435- 



BNSDOCID: <WO___20040923eoA2J_> 



wo 2004/092360 



PCT/US2004/011710 



HLA A 0201 - 10 mers 


Maximum possible score usinq tiiis molecule type 


3925227.1 


Rank 


Start position 


Sequence li 9/6 of max. score 


Score 


1 


77 


SEQ ID NO: 8594 || ' 


0.00 % 


147.97188 


2 


62 


SEO ID NO: 8595 II 


0.00 % 


143.59176 


3 


113 


SEQ ID NO: 8596 || 


0.00 % 


106.83684 


4 


78 


SEQ ID NO: 8597 || 


0.00 % 


83.526984 


5 


86 


SEQ ID NO: 8598 || 


0.00% 


83.526984 


6 


74 


SEQ ID NO: 8599 II 


0.00 % 


69.552 


7 


121 


SEQ ID NO: 8600 || 


0.00 % 


61.06776 


8 


12 


SEQ ID NO: 8601 || 


0.00 % 


50.232 


9 


44 


SEQ ID NO: 8602 || 


0.00 % 


26.082 


10 


4 


SEQ ID NO: 8603 || 


0.00 % 


18.3816 


11 


0 


SEQ ID NO: 8604 !| 


0.00 % 


17.38386 


12 


72 


SEQ ID NO: 8605 |j 


0.00 % 


17.1396 


13 


22 


SEQ ID NO: 8606 jj 


0.00 % 


16.21914 


14 


122 


SEQ ID NO: 8607 || 


0.00 % 


14.02908 


15 


64 


SEQ ID NO: 8608 || 


0.00 % 


11.161854 


16 


46 


SEQ ID NO: 8609 || 


0.00 % 


10.34586 


17 


54 


SEQ ID NO: 8610 || 


0.00 % 


8.846145 


18 


47 


SEQ ID NO: 8611 || 


0.00 % 


7.575080337 


19 


131 


SEQ ID NO: 8612 || 


0.00 % 


7.452 1 


20 


114 


1 SEQ ID NO: 8613 II 


0.00 % 


6.735366 j 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


Ranic 


Start position 


Sequence 


o/o of max. score 


Score 


1 


69 


SEQ ID NO: 8614 


5.55 % 


2 


2 


22 


SEQ ID NO: 8615 


5 % 1 


1.8 


3 


77 


SEQ ID NO: 8616 


5 % i 


1.8 


4 


141 


SEQ ID NO: 8617 


3.33% 1 


1.2 


5 


60 


SEQ ID NO: 8618 


2.22 % 1 


0.8 


6 


95 


SEQ ID NO: 8619 


2.22% 


0.8 


. 7 


36 


SEQ ID NO: 8620 


1.66% 


0.6 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Ranic 


Start position 


Sequence || 9/6 of max. score 


Score 


1 


41 


SEO ID NO: 8621 || 3.33 % 


1.2 
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2 


68 


SEQ ID NO: 8622 


3.33 % 




3 


94 


SEQ ID NO: 8623 


3.33 % 


1.2 


4 


31 


SEQ ID NO: 8624 


2.77 % 


1 


5 


127 


SEQ ID NO: 8625 


2.5 % 


0.9 




HLA B7 - 9 mers 


Maximum possible score using tliis molecule type 


5400 


Rank 


Start position 


Sequence 


^/o of nriaXd score 


Score 


1 


48 


SEQ ID NO: 8626 


0.74 % 


40 


2 


20 


SEQ ID NO: 8627 


0.37 °/o 


20 


3 


121 il SEQ ID NO: 8628 


0.33 % 


18 


4 


h-4f-H 


SEQ ID NO: 8629 


0.07 % 


4 


5 




SEQ ID NO: 8630 


0.07 % '! 


4 


6 


37 II SEQ ID NO: 8631 


0.07 % 


4 


7 


41 II SEQ ID NO: 8632 


0.07 % 


4 


8 


53 II SEQ ID NO: 8633 


0.07 % 


4 


9 


65 II SEQ ID NO: 8634 


0.07 % V 


4 


10 


70 


SEQ ID NO: 8635 


0.07 % 


4 


11 


71 


SEQ ID NO: 8636 


■ 0.07 % 


4 


12 


74 


SEQ ID NO: 8637 


0.07 % 


4 


13 


79 


SEQ ID NO: 8638 


0.07 % 


4 


14 


88 


SEQ ID NO: 8639 1 


0.07 % 


4 


15 


105 


SEQ ID NO: 8640 i 


0.07 % 


4 


16 


106 i 


SEQ ID NO: 8641 | 


0.07 % 


4 


17 


124 1 


SEQ ID NO: 8642 | 


0.07 % 


4 


18 


1 ,1 


SEQ ID NO: 8643 I 


0.03 % 


2 


19 


120 1 


_SEQ ID NO: 8644 j 


0.03 % 


1.8 


20 


11 1 


SEQ ID NO: 8645 


0.02 % 


1.2 



HLA B7 - 10 mers 


Maximum possible score using this molecule type 


5400 


Ranl^ II Start position 


Sequence 


o/o of max. score 


Score 


1 n 66 


SEQ ID NO: 8646 


1.48 % 


80 


2 II 123 


SEQ ID NO: 8647 


0.74 % 


1 40 


3 II 20 


SEQ ID NO: 8648 


0.37 % 


20 


4 II 64 


SEQ ID NO: 8649 


0.22 % 


12 




SEQ ID NO: 8650 


0.11 % 


6 


6 II 54 


SEQ ID NO: 8651 


0.09 % 


5 


7 li 19 


SEQ ID NO: 8652 


0.07 % 


4 
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8 


36 


SEO ID NO: 8653 


0.07 % 


4 


9 


47 


SEQ ID NO: 8654 


0.07 % 


4 


10 


52 


SEQ ID NO: 8655 


0.07 % 


4 


11 


69 


SEQ ID NO: 8656 


0.07 % 


4 


12 


70 


SEO ID NO: 8657 


0.07 % 


4 


13 


73 


SEQ ID NO: 8658 


0.07 % 


4 


14 


78 


SEQ ID NO: 8659 


0.07 % 


4 


15 


83 


SEQ ID NO: 8660 


0.07 % 


- 4 


16 


86 


SEO ID NO: 8661 


0.07 % 


4 


17 


101 


SEQ ID NO: 8662 


0.07 % 


4 


18 


104 


SEQ ID NO: 8663 


0.07 % 


4 


19 


105 


SEQ ID NO: 8664 


0.07 % 


4 


20 


15 


SEQ ID NO: 8665 


0.03 % 


2 


Table 19: Epitopes for SEQ ID NO: 6045 


II HLA Al - 9 mers 


llMaximum possible score using tliis molecule type 


5625 




Start position 


Sequence 


9/6 of max. score 


Score 


1 


4 


SEQ ID NO: 8666 


0.02 % 


1.35 


2 


66 


SEQ ID NO: 8667 


0.02 % 


1.35 


3 


33 


SEQ ID NO: 8668 


0.02% 


1.25 


4 


44 


SEQ ID NO: 8669 


0.01 % 


1 


5 


50 


SEQ ID NO: 8670 


0.01 % 


1 


6 


14 


SEQ ID NO: 8671 


0.01 % 


0.75 


7 


48 


SEQ ID NO: 8672 


0.01 % 


0.75 


8 


11 


SEQ ID NO: 8673 


0.00 % 


0.5 






HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


4 


SEQ ID NO: 8674 


0.12 % 


6.75 


2 


66 


SEQ ID NO: 8675 


0.12 % 


6.75 


3 


10 


SEQ ID NO: 8676 


0.00 % 


0.5 


4 


28 


SEQ ID NO: 8677 


0.00 % 


0.5 


5 


32 


SEQ ID NO: 8678 


0.00 9/0 


0.5 


6 


47 


SEQ ID NO: 8679 


0.00 % 


0.5 



5 
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HLA A3 - 9 ,mers 


Maximum possible score usinq this molecule type 


12150 


Rank 


Start position 


1 Sequence 


1 °/o of max. score 


Score 


1 


17 


1 SEO ID NO: 8680 


1 0.24 % 


30 


2 ■ 1 


44 


1 SEO ID NO: 8681 


1 0.07 % 


9 


3 1 


19 


1 SEO ID NO: 8682 


1 0.06 % 


8.1 


4 1 


50 


1 SEO ID NO: 8683 


1 0.04 % 


5.4 


5 1 


29 


1 SEO ID NO: 8684 


1 0.03 % 


"4 i 


6 1 


52 


SEO ID NO: 8685 


1 0.02 % 


3.24 


7 


54 


SEQ ID NO: 8686 


1 0.02 % 


3 1 


8 1 


11 


SEO ID NO: 8687 


0.01 % 


1.8 


9- 1 


37 


SEQ ID NO: 8688 


0.01 % 


1.8 1 


10 1 


25 


SEO ID NO: 8689 


0.01 % 


1.35 1 


11 1 


10 


SEQ ID NO: 8690 


0.00 % 1 


0.9 1 


12 1 


16 


SEQ ID NO: 8691 


0.00 % 


o.9n 


13 1 


35 


SEQ ID NO: 8692 


0.00 % 1 


0.6 



HLA A3 - 10 mers 



Maxlmi 


m possible score u 


sing tiiis molecule type 


12150 1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


49 


SEQ ID NO: 8693 


0.44 % 


54 n 


2 


17 


SEQ ID NO: 8694 


0.22 % 1 


27 1 


3 


10 


SEQ ID NO: 8695 


0.14% 1 


18 1 


4 


16 


SEQ ID NO: 8696 


0.07 % 1 


9 1 


5 


32 


SEQ ID NO: 8697 


0.04 % 1 


6 


6 


19 


SEQ ID NO: 8698 


0.01 % 1 


1.8 1 


7 


29 


SEQ ID NO: 8699 


0.00 % ! 


1.2 1 


8 


23 


SEQ ID NO: 8700 


0.00 % 1 


0.9 1 


9 


26 


SEQ ID NO: 8701 


0.00 % [j 0.9 1 



HLA A24 - 9 mers 


Maximi 


jm possible score usinq this molecule type 


1 1596.672 


Rank 


Start position 


Sequence 1 


9/o of max. score 


Score 


1 


18 


SEQ ID NO: 8702 1 


1.87 % 


30 


2 


24 


SEQ ID NO: 8703 1 


0.65 % 


10.5 


3 


9 


SEQ ID NO: 8704 I 


0.52 % 


8.4 


4 


12 


SEQ ID NO: 8705 1 


0.52 % 


8.4 


5 


28 


SEQ ID NO: 8706 1 


0.52 % 


8.4 


6 


42 


SEQ ID NO: 8707 I 


0.52 % 1 


8.4 
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7 II 57 


SEO ID NO: 8708 


0.52 % 


1 8.4 


8 II 66 


SEQ ID NO: 8709 


0.52 % 


1 8.4 


9 II 55 


SEQ ID NO: 8710 


0.51 % 


1 8.25 


10 II 0 


SEO ID NO: 8711 


0.48 % 


1 7.7 


^1 22 


SEO ID NO: 8712 


0.45 % II 7.2 


12 II 10 


SEQ ID NO: 8713 


0.37 % II 6 


13 


1 25 


SEO ID NO: 8714 


0.37 % II 6 


14 


1 30 


SEQ ID NO: 8715 


0.37 % II 6 


15 


1 19 


SEQ ID NO: 8716 


0.35 % 


i 5.6 


16 


1 40 


SEQ ID NO: 8717 


0.31 % 




17 


1 3 


SEQ ID NO: 8718 


0.30 % 


1 4.8 


18 


65 


SEQ ID NO: 8719 


0.30 % II 4.8 


19 


14 


SEQ ID NO: 8720 


0.27 % II 4.32 


20 


56 


SEQ ID NO: 8721 


0.25 % H 4 




HLA A24 - 10 mers 


Maximum possible score using this molecule type 


1596.672 


Rank 


1 Start position 


Sequence 


% of max. score 


Score 


1 


1 55 


SEQ ID NO: 8722 


18.78 % 


300 


2 


1 18 


SEQ ID NO: 8723 


2.63 % 


42 


3 


1 21 


SEQ ID NO: 8724 


2.25 % 


36 


4 


1 2 


SEQ ID NO: 8725 


1.87 % 


30 


5 


1 " 24 


SEQ ID NO: 8726 


1.87 % ■ 


30 


6 


1 11 


SEQ ID NO: 8727 


0.52 % 


8.4 




40 


SEQ ID NO: 8728 


0.52 % 


8.4 


8 


! 65 


SEQ ID NO: 8729 


0.42 % 


6.72 


9 


9 


SEQ ID NO: 8730 


0.37 % , 


6 


10 


i 8 


SEQ ID NO: 8731 


0.35 % 


5.6 


11 


1 27 


SEQ ID NO: 8732 


0.35 % 


5.6 


12 




SEQ ID NO: 8733 


0.35 % 


5.6 


13 




SEQ ID NO: 8734 


0.31 % 


5 


14 


17 


SEQ ID NO: 8735 


0.25 % 


4 


15 


29 


SEQ ID NO: 8736 


0.25 % 


4 


16 


64 


SEQ ID NO: 8737 


0.25 % 


4 


17 


16 


SEQ ID NO: 8738 


0.22 % 


3.6 


18 


10 


SEQ ID NO: 8739 


0.18 % 


3 


19 


13 


SEQ ID NO: 8740 


0.18 % 


2.88 


20 


23 


SEQ ID NO: 8741 


0.08 % 


1.4 
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HLA A 0201 - 9 mers 



Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


19 


^EQ ID NO: 8742 


0.03 % 


1310.8823136 


2 


15 


SEQ ID NO: 8743 


0.02 % 


1082.4143022 


3 


16 


SEQ ID NO: 8744 


0.02 % 


1040.33238624 


4 


49 


SEQ ID NO: 8745 


0.00 % 


382.536 


5 


25 


SEQ ID NO: 8746 


0.00 % 


342.863529264 


6 


56 


SEQ ID NO: 8747 


0.00 % 


63.28397376 


7 


12 


SEQ ID NO: 8748 


0.00 % 


40.19736105 


8 


10 


SEQ ID NO: 8749 


0.00 % 


21.3624 


9 


22 


SEQ ID NO: 8750 


0.00 % 


19.7762418 


10 


26 


SEQ ID NO: 8751 


0.00 % 


12.6684 


11 


20 


SEQ ID NO: 8752 


0.00 % 


11.544666 


12 


37 


SEQ ID NO: 8753 


0.00 % 


10.4328 


13 


32 


SEQ ID NO: 8754 


0.00 % 


8.4456 


14 


23 


SEQ ID NO: 8755 


0.00 % 


6.288.8049 


15 


47 


SEQ ID NO: 8756 


0.00 % 


6.0858 


16 


3 


SEQ ID NO: 8757 


0.00 % 


4.582929078 


17 


18 


SEQ ID NO: 8758 


0.00 % 


4.4855150505 


18 


28 


SEQ ID NO: 8759 


0.00 % 


4.2923589 


19 


62 


SEQ ID NO: 8760 


0.00 % 


2.88098391 


20 


27 


SEQ ID NO: 8761 


0.00 % 


1.699677 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule type 


3925227.1 


Ranic 


Start position 


Sequence 


% of max. score 


Score 


1 


17 


SEQ ID NO: 8762 


0.16 % 


6459.14167272 


2 


19 


SEQ ID NO: 8763 


0.01 % 


607.88448 


3 


25 


SEQ ID NO: 8764 


0.00 % 


126.83304 


4 


11 


SEQ ID NO: 8765 


0.00 % 


63.16728165 


5 


15 


SEQ ID NO: 8766 


0.00 % 


53.54651988 


6 


37 


SEQ ID NO: 8767 


0.00 % 


28.51632 


7 


14 


SEQ ID NO: 8768 


0.00 % 


21.8247414 


8 


29 


SEQ ID NO: 8769 


0.00 % 


21.3624 


9 


26 


SEQ ID NO: 8770 


0.00 % 


19.42488 


10 


3 


SEQ ID NO: 8771 


0.00 % 


17.2167282 




48 


SEQ ID NO: 8772 


0.00 % 


15.7068219 




1 12 


SEQ ID NO: 8773 


0.00 % 


9.8581266 
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13 H 27 


SEQ ID NO: 8774 


0.00 % 


7.3086111 


14 II 39 


SEQ ID NO: 8775 


0.00 % 


7.10976 


15 II 23 


SEQ ID NO: 8776 


0.00 % 


5.7419523 


16 


22 


SEQ ID NO: 8777 


0.00 % 


4.599126 1 


17 


45 


SEQ ID NO: 8778 


0.00 % 


2.5495155 


18 


31 


SEQ ID NO: 8779 


0.00 % 


2.52747 


19 


52 


SEQ ID NO: 8780 


0.00 % 


2.383605 


20 


20 


SEQ ID NO: 8781 


0.00 % 


2.332847151 





HL.A A 1101 - 9 mers 


Maximum possible score usinq this molecule type 


1 36 


Rank jj Start position 


Sequence || 9/o of max. score 


1 Score 


1 « 44 


SEQ ID NO: 8782 II 3.33 % 


1 1.2 




HLA A iioi - 10 mers 


Maximum possible score using this hnolecule type || 


36 


Rank || Start position 


II Sequence |{ ^/o of max. score || 


Score 



HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


3 


SEQ ID NO: 8783 


0.37 % 


20 


2 


12 


SEQ ID NO: 8784 || 0.37 % 


20 


3 


22 


SEQ ID NO: 8785 


0.37 % 


20 


4 


56 


SEQ ID NO: 8786 


0.37 % 


20 


5 


30 


SEQ ID NO: 8787 


0.22% 


12 


6 


9 


SEQ ID NO: 8788 


0.07 % 


4 


7 


1 10 


SEQ ID NO: 8789 


0.07;% 


4 


8 


1 19 


SEQ ID NO: 8790 


0.07 % 


4 


9 


25 


SEQ ID NO: 8791 


0.07 % 


4 


10 


28 


SEQ ID NO: 8792 


0.07 % 


4 


11 


42 


SEQ ID NO: 8793 


0.07 % 


4 


12 


65 


SEQ ID NO: 8794 


0.07 % 


4 


13 


35 


SEQ ID NO: 8795 


0.05 % 


3 


14 


66 


SEQ ID NO: 8796 


0.02 % 


1.2 


15 


15 


SEQ ID NO: 8797 


0.01 % 


1 


16 


47 


SEQ ID NO: 8798 


0.01 % 


1 


17 


20 


SEQ ID NO: 8799 


0.01 % 


0.6 


18 


23 


SEQ ID NO: 8800 


0.00 % 


0.5 1 
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1 27 


1 SEQ ID NO: 8801 


1 0.00 % 


1 0.5 1 




HLA B7 - 10 mers 


Maximum possible score u 


sing tills molecule type 


5400 


Rank 


1 Start position 


Sequence 


% of max. score 


Score 


1 


1 27 


SEQ ID NO: 8802 


0.37 % 


20 


2 


1 8 


SEQ ID NO: 8803 


0.07 % 


'. 4 1 


3 


1 9 


SEQ ID NO: 8804 


0.07 % 


4 


4 


1 11 


SEQ ID NO: 8805 


0.07 % 


4 


5 


1 17 


SEQ ID NO: 8806 


0.07 % 


4 


6 


1 29 


SEQ ID NO: 8807 


0.07 % 


4 


7 


1 41 


SEQ ID NO: 8808 


0.07 % 


4 


8 


1 52 


SEQ ID NO: 8809 


0.07 % 


4 


9 


1 64 


SEQ ID NO: 8810 


0.07 % 


4 


10 


1 65 


SEQ ID NO: 8811 


0.07 % 


4 


11 


1 3 


SEQ ID NO: 8812 | 


0.03 % 


2 


12 


1 23 


SEQ ID NO: 8813 1 


0.03 % 


2 


13 


21 


SEQ ID NO: 8814 | 


0.02 % 


1.2 


14 


15 


SEQ ID NO: 8815 1 


0.01 % 


1 


15 


35 


SEQ ID NO: 8816 I 


0.01 % 


0.6 


16 


39 


SEQ ID NO: 8817 I 


0.01 % 


0.6 


17 


12 


SEQ ID NO: 8818 1 


0.00 % 


0.5 


18 


22 


SEQ ID NO: 8819 i 


0.00 % 


0.5 


19 


45 


SEQ ID NO: 8820 


0.00 % 


. 0.5 



Table 20: Epitopes for SEQ ID NO: 6046 



1 HLA Al - 9 mers 


iMaximum possible score using tliis molecule type 


5625 


Lma 


1 Start position II Sequence 


9/6 of max. score 


Score 


1 


186 II SEQ ID NO: 8821 


2.22 % 


125 


2 


156 II SEQ ID NO: 8822 


0.88 % 


50 


3 


14 II SEQ ID NO: 8823 


0.08 % 


4.5 


4 


0 II SEQ ID NO: 8824 


0.04 % 


2.5 


5 


29 II SEQ ID NO: 8825 


0.04 % 


2.5 


6 


85 II SEQ ID NO: 8826 


0.04 % 


2.5 1 


7 


168 II SEQ ID NO: 8827 


0.04 % 


2.5 1 


8 


133 II SEQ ID NO: 8828 


0.02 % 


1.35 1 
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9 


111 


SEQ ID NO: 8829 


0.02 % 


1.125 


10 


61 


SEQ ID NO: 8830 


0.01 % 


1 


11 


7 


SEQ ID NO: 8831 


0.01 % 


0.9 


12 


131 


SEQ ID NO: 8832 


0.01 % 


0.9 


13 


211 


SEQ ID NO: 8833 


0.01 % 1 


0.625 


14 


4 


SEQ ID NO: 8834 


0.00 % 


0.5 


15 


43 


SEO ID NO: 8835 


0.00 % 


0.5 


16 


95 


SEQ ID NO: 8836 


0.00 % 


0.5 


17 


136 


SEQ ID NO: 8837 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 


Rank | 


Start position 


Sequence || °/o of max. score 


Score 


i 1 


133 , 


SEQ ID NO: 8838 || 0.04 % 


2.7 


2 ! 


84 


SEQ ID NO: 8839 || 0.04 % 


2.5 


3 


167 


SEQ ID NO: 8840 || 0.04 % 


2.5 


4 


186 


SEQ ID NO: 8841 | 0.04 % 


2.5 


5 


131 


SEQ ID NO: 8842 || 0.04 % 


2.25 


6 


14 


SEQ ID NO: 8843 ll 0.03 % 


1.8 


7 


205 


SEQ ID NO: 8844 || 0.02 % 


1.25 


8 


i 111 


SEQ ID NO: 8845 || 0.02 % 


1.125 


9 


i 60 


SEQ ID NO: 8846 || 0.01 % 


1 


10 


1 188 


SEQ ID NO: 8847 i| 0.01 % 


0.75 


11 


1 211 


SEQ ID NO: 8848 II 0.01% 


0.625 


12 


1 26 


SEQ ID NO: 8849 \\ 0.00 % 


0.5 


13 


1 94 


SEQ ID NO: 8850 


1 0.00 % 


0.5 


14 


1 135 


1 SEQ ID NO: 8851 


0.00 % 


0.5 


15 


1 168 


1 SEQ ID NO: 8852 


1 0.00 % 


0.5 




HLA A3 - 9 mers 1 


Maximum possible score using this molecule type 


12150 1 


Rank 


Start position || Sequence 


9/6 of max. score 


Score 1 


1 


43 II SEQ ID NO: 8853 


0.24 % 


30 1 


2 


90 il SEQ ID NO: 8854 


0.14 % 


18 1 


3 


148 il SEQ ID NO: 8855 


0.09 % 


12 1 


4 


4 II SEQ ID NO: 8856 


0.05 % 


6.75 1 


5 


24 II SEQ ID NO: 8857 


0.04 % 


6 


6 


19 II SEQ ID NO: 8858 


0.04 % 


5.4 


7 


136 II SEQ ID NO: 8859 


0.04 % 


5.4 1 
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8 


54 


SEQ ID NO: 8860 


0.03 % II 4.5 


9 


32 


SEQ ID NO: 8861 


0.03 % II 4 


10 


14 


SEQ ID NO: 8862 


0.02 % II 3.6 


11 


59 


SEQ ID NO: 8863 


0.02 % II 3.6 


12 


88 


SEQ ID NO: 8864 


0.02 % II 3 


13 


87 


SEQ ID NO: 8865 


0.02 % 11 2.7 


14 


29 


SEQ ID NO: 8866 


0.01 % II 1.8 


15 


48 


SEQ ID NO: 8867 


0.01 % II - 1.8 


16 


115 


SEQ ID NO: 8868 


0.01 % II 1.8 


17 


186 


SEQ ID NO: 8869 


0.01 % II 1.8 


18 


106 


SEQ ID NO: 8870 


0.01 % II 1.5 


19 


53 


SEQ ID NO: 8871 


0.01 % II 1.35 


20 


173 


SEQ ID NO: 8872 


0.00 % ir 1.2 




HLAA3-10mers 


Maximum possible score using th\s molecule type 


12150 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


24 


SEQ ID NO: 8873 


0.22 % 


27 


2 


54 


SEQ ID NO: 8874 


0.18 % 


22.5 


3 


135 


SEQ ID NO: 8875 


0.08% 


10.8 


4 


51 


SEQ ID NO: 8876 


0.07% 


9 


5 


13 


SEQ ID NO: 8877 


0.06 % 


8.1 1 


6 


26 


SEQ ID NO: 8878 


0.04 % 


6 


7 


31 


SEQ ID NO: 8879 


0.04 % X 


6 


8 


90 


SEQ ID NO: 8880 


0.04 % 


6 


9 


43 


SEQ ID NO: 8881 


0.03 % i 


4.5 


10 


19 


SEQ ID NO: 8882 


0.03 % 


4.05 


11 


169 


SEQ ID NO: 8883 


0.02 % 


3 


12 


87 


SEQ ID NO: 8884 


0.02 % 


2.7 


13 


84 


SEQ ID NO: 8885 


0.01 % 


1.8 


14 


88 


SEQ ID NO: 8886 


0.01 % 


1.8 


15 


94 


SEQ ID NO: 8887 


0.01 % 


1.8 


16 


64 


SEQ ID NO: 8888 


0.00 % 1 


1.2 


17 


131 


SEQ ID NO: 8889 


0.00 % 1 


1.2 


18 


99 


SEQ ID NO: 8890 


0.00 % 1 


1 


19 


53 II SEQ ID NO: 8891 


0.00 % 1 


0.9 


20 


85 II SEQ ID NO: 8892 


0.00 % 1 


0.9 



ll HLA A24 - 9 mers 
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Maximum possible score usinq this molecule type 


1596.672 


Rank 




Start position^ 


Sequence 


9/o of max. score 


Score 


1 


1 


196 


SEQ ID NO: 8893 


27.55 % 


440 


2 


1 


44 


SEQ ID NO: 8894 


18.78 % 


300 


3 




36 


SEQ ID NO: 8895 


12.52 % 


200 


4 


1 


92 


SEQ ID NO: 8896 


12.52 % 


200 


5 


II 


109 


SEQ ID NO: 8897 


2.70 % 


43.2 


6 


II 


25 


SEQ ID NO: 8898 


1.87 % 


30 


7 


II 


93 


SEQ ID NO: 8899 


1.12 % 


18 


8 


II 


12 


1 SEQ ID NO: 8900 


0.75 % 


12 


9 


1 


123 


1 SEO ID NO: 8901 


0.70 % 


11.2 


10 


1 


7 


1 SEQ ID NO: 8902 


0.64 % 


10.368 


11 


II 


17 


1 SEQ ID NO: 8903 


0.52 % 


8.4 


12 


1 


139 


1 SEQ ID NO: 8904 


0.52 % 


8.4 


13 


1 


193 


i SEQ ID NO: 8905 


0.46 % 


7.5 


14 


II 


6 


i SEQ ID NO: 8906 


0.45 % 


7.2 


15 


+ 


19 11 SEQ ID NO: 8907 


0.45 % 


7.2 


16 


1 


110 II SEQ ID NO: 8908 


0.45 % 


7.2 


17 


1 


114- 


1 SEQ ID NO: 8909 


0.45 % 


7.2 


18 


1 


210 


1 SEQ ID NO: 8910 


0.45 % 


7.2 


19 




46 


1 SEQ ID NO: 8911 


0.42 % 


6.72 


20 


1 


52 


1 SEQ ID NO: 8912 


0.37 % 


6 



HLA A24 - 10 mers 



jvjaximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


92 


SEQ ID NO: 8913 


7.51 % 


120 


2 


42 


SEQ ID NO: 8914 


2.63 % 


42 


3 1 


109 


SEQ ID NO: 8915 


2.25 % 


36 


4 J 


23 


SEQ ID NO: 8916 


1.87 % 


30 


5 


34 


SEQ ID NO: 8917 


0.75 % 


12 


6 1 


6 


SEQ ID NO: 8918 


0.64 % 


10.368 


7 


45 


SEQ ID NO: 8919 


0.63 % 


10.08 


8 i 


196 


SEQ ID NO: 8920 


0.62 % 


10 


9 


44 


SEQ ID NO: 8921 


0.56 % 


9 


10 


40 


SEQ ID NO: 8922 


0.55 % 


8.8 


11 


62 


SEQ ID NO: 8923 


0.46 % 


7.5 


12 


193 


SEQ ID NO: 8924 


0.46 % 


7.5 


13 


18 


SEQ ID NO: 8925 


0.45 % 


7.2 
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14 


1 1 -3 


SEQ ID NO: 8926 


0.45 % 


1 7.2 


15 


56 


SEQ ID NO: 8927 


0.37 % 


1 6 


16 


176 


SEQ ID NO: 8928 


0.37 % 




17 


16 


SEQ ID NO: 8929 


n OA 




1 5.6 


18 i 


138 


SEQ ID NO: 8930 


0.35 % 


5.6 


19 1 


127 


SEQ ID NO: 8931 


0.33 % 


5.28 ] 


20 1 


36 


SEQ ID NO: 8932 


0.31 % 


5 



HLA A 0201 - 9 mers 1 


Maxim 


um possible scor 


e using tliis molecule type 


1 3925227.1 | 


Rank 


Start position 


Sequence || o/o of max. scorc> 


1 Score 1 




13 


SEQ ID NO: 8933 


1 0.04 P/o 


i — ^ 

1 1793.676528 1 


2 


87 


SEQ ID NO: 8934 


1 0.03 % 


1 1415.3832 1 


3 


24 


SEQ ID NO: 8935 


1 0.01 °7o 1 


1 618.0996816 


4 


19 


SEQ ID NO: 8936 


1 0.00 % ~ 


! 223.23708 1 


5 


12 


SEQ ID NO: 8937 


1 0.00 % 


1 210.36400875 1 


6 


51 


SEQ ID NO: 8938 


1 0.00 % 


1 198.30859992 1 


7 


53 


SEQ ID NO: 8939 


r 0.00 % 


1 194.477328 1 


8 


88 


SEQ ID NO: 8940 


1 0.00 % 


180.58536756 1 


9 


106 


SEQ ID NO: 8941 || 0.00 % 


169.74828 1 


10 


54 


SEQ ID NO: 8942 11 0.00 oa ~ 


70.09848 1 


11 


59 1 


SEQ ID NO: 8943 


0.00 % 


43.42032 1 


12 


94 11 SEO ID NO: 8944 


0.00 % 


41.792058 1 


13 
14 


20 II SEO ID NO: 8945 
63 II SEO ID NO: 8946! 


0.00 % 1 


37.46088108 1 


15 


22 II SEO ID NO: 8947 


0.00 % 1 
0.00 % 1 


35.73520902 
20.5916435109 1 


16 


47 1 


SEQ ID NO: 8948 


~ 0.00 % 1 


12.233222865 ! 


17 


66 1 


SEQ ID NO: 8949 


0.00 % 


12.2199 1 


18 


56 1 


SEQ ID NO: 8950 


0.00 % 


11.486706 1 


19 


67 1 


SEQ ID NO: 8951 


0.00 % 


6.416172 1 


20 


117 1 


SEQ ID NO: 8952 


0.00 % 


5.827464 1 



HLA A 0201 - 10 mers 


Maxim 


um possible scor 


e using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


43 


SEQ ID NO: 8953 


0.10 % 


3977.8497792 
836.2525104 


2 


24 


SEQ ID NO: 8954 


0.02 % 


3 


51 


SEQ ID NO: 8955 


0.02 % 


815.616432 


4 


49 


SEQ ID NO: 8956 


0.01 % 


660.3245145 
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5 


19 


SEQ ID NO: 8957 


0.00 % 


251.837856 


6 


59 


SEQ ID NO: 8958 


0.00 % 


159.9696 


7 


12 


SEQ ID NO: 8959 


0.00 % 


155.245377 1 


8 


45 


SEQ ID NO: 8960 


0.00 % 


141.1974531 1 


9 


21 


SEQ ID NO: 8961 


0.00 % 


117.22672269 


10 


53 


SEQ ID NO: 8962 


0.00 % 


84.55536 


11 


87 


SEQ ID NO: 8963 


0.00 % 


65.5671672 


12 


13 


SEQ ID NO: 8964 


0.00 % 


64.88888616 


13 


153 


SEQ ID NO: 8965 


0.00 % 


49.13352 


14 


178 


SEQ ID NO: 8966 


0.00 % 


26.082 


15 


18 


SEQ ID NO: 8967 


0.00 % 


24.802259691 


16 


116 


SEQ ID NO: 8968 


0.00 % 


21.5616168 


17 


65 


SEQ ID NO: 8969 


0.00 % 


20.77383 


18 


86 


SEQ ID NO: 8970 


0.00 % 


15.7068219 


19 


27 


SEQ ID NO: 8971 


0.00 % 


12.3159135 


20 


46 


SEQ ID NO: 8972 


0.00 % 


11.45624789925 





HLA A llOl - 9 mers 



Maximum possible score using tliis molecule type 


36 


Rank 


Start position 


Sequence 


^/o of max. score 


Score 


1 


4 


SEQ ID NO: 8973 


12.5 % 


4.5 


2 


136 


SEQ ID NO: 8974 


3.33 % 


1.2 


3 


156 


SEQ ID NO: 8975 


3.33 % 


1.2 


4 


140 


SEQ ID NO: 8976 


1.66 % 


0.6 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


169 


SEQ ID NO: 8977 


5.55 % 


2 


2 


94 


SEQ ID NO: 8978 


3.33 % 


1.2 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start position 


Sequence 


o/o of max. score 


Score 


1 


146 


SEQ ID NO: 8979 


0.74 % 


40 


2 


154 


SEQ ID NO: 8980 


0.74 % 


40 


3 


80 


SEQ ID NO: 8981 


0.66 % 


36 


4 


139 


SEQ ID NO: 8982 


0.33 % 


18 


5 


83 


SEQ ID NO: 8983 


0.22% 


12 
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6 


209 


SEQ ID NO 


: 8984 II 


0.22 % 




12 1 


7 


7 


SEQ ID NO 


: 8985 II 


0.11 % 




6 J 


8 


3 


SEQ ID NO 


: 8986 II 


0.07 % 


1 


4 


9 


6 


SEQ ID NO 


: 8987 II 


0.07 % 


ii 


4 


10 


12 


SEQ ID NO 


: 8988 II 


0.07 % 


II 


4 


11 


19 


SEQ ID NO 


: 8989 II 


0.07 % 


II 


4 


12 


24 


SEQ ID NO 


: 8990 II 


0.07 % 


1 


4 


13 


38 


SEQ ID NO 


: 8991 II 


0.07 % 


1- 


4 1 


14 


46 


SEQ ID NO 


8992 II 


0.07 % 


II 


4 1 


15 


56 


SEQ ID NO 


8993 II 


0.07 % 


1 


"~4 i 


16 


110 


SEQ ID NO 


8994 II 


0.07 % 


I 


4 1 


17 


114 


SEQ ID NO 


8995 II 


0.07 % 


1 


4 1 


18 


123 


SEQ ID NO 


8996 II 


0.07 % 




4 1 


19 


129 


SEQ ID NO 


8997 II 


0.07 % 


II 


4 1 


20 


166 


SEQ ID NO 


8998 II 


0.07 % 




4 1 








HLA B7 - 


10 mers 
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Table 21: Epitopes for SEQ ID NO: 6047 


HLA Al - 9 mers 


Maximum possible score usinq tliis molecule type 


5625 


Rank 


Start position 


Sequence 


9/o of max. score 


Score 


1 


53 


SEQ ID NO: 9019 


2 % 


112.5 


2 


10 


SEQ ID NO: 9020 


0.08 % 


-4.5 


3 


33 


SEQ ID NO: 9021 


0.02 % 


1.5 


4 


3 


SEO ID NO: 9022 


0.00 % 


0.5 


5 


27 


SEQ ID NO: 9023 


0.00 % 


0.5 


6 


29 


SEQ ID NO: 9024 


0.00 % 


0.5 




HLA Al " 10 mers 


■Maximum possible score using .this molecule type 


1 5625 


Rank 


Start position 


Sequence 


<Vb of max. score 


1 Score 


1 


10 


SEQ ID NO: 9025 


0.8% 


1 45 


2 


52 


SEQ ID NO: 9026 


0.2 % 


1 11.25 


3 


50 


SEQ ID NO: 9027 


0.04 % 


1 2.5 


4 


32 


SEQ ID NO: 9028 


0.02 % 


1 1.5 


5 


48 


SEQ ID NO: 9029 


0.02 % 


1.35 


6 


27 


SEQ ID NO: 9030 


0.00 % 


0.5 




HLA A3 - 9 mers 


Maximum possible score using this molecule type 


1 12150 


Rank 


Start position 


Sequence 


o/o of max. score 


1 Score 


1 


38 


SEQ ID NO: 9031 


1.85 % 


1 225 


2 


17 


SEQ ID NO: 9032 


0.02 % 


1 3.6 


3 


2 


SEQ ID NO: 9033 


0.02 % 


1 2.7 


4 


37 


SEQ ID NO: 9034 


0.01 % 


1 1.8 


5 


27 


SEQ ID NO: 9035 


0.01 % 


1 1.35 


6 


13 


SEQ ID NO: 9036 


0.00 % 


1 0.675 


7 


14 


SEQ ID NO: 9037 


0.00 % 


1 0.6 



HLA A3 - 10 mers 


Maximum possible score usinq this molecule type 


12150 


Rank li Start position 


Sequence 


o/o of max. score 


Score 


1 1 13 


SEQ ID NO: 9038 


0.04 % 


6 


2 1 37 


SEQ ID NO: 9039 


0.01 % 11 2.025 
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3 


2 


SEQ ID NO: 9040 


0.00 % 


0.9 


4 


19 


SEQ ID NO: 9041 


0.00 % 


0.675 




16 


SEQ ID NO: 9042 


0.00 % 


0.54 






1 HLA A24 - 9 mers 


jMaximum possible score using this molecule type 


1 1596.672 




Start position 


Sequence 


9/6 of max. score 


1 Score 


1 


20 


SEQ ID NO: 9043 


1.25 % 


1 20 


2 


6 


SEQ ID NO: 9044 


0.52 % 


1 8.4 


3 


5 


SEQ ID NO: 9045 


0.51 % 


1 8.25 


4 


35 


SEQ ID NO: 9046 


0.36 % 


1 5.76 


5 


31 • 


SEQ ID NO: 9047 


0.35 % 


1 5.6 


6 


43 


SEQ ID NO: 9048 


0.27 % 


4.4 


7 


13 


SEQ ID NO: 9049 


0.26 % 


1 4.2 


8 


32 


SEQ ID NO: 9050 


0.21 % 


t 3.36 


9 


2 


SEQ ID NO: 9051 


0.11 % 


1 1.8 


10 


9 


SEQ ID NO: 9052 


0.10 % 


[. 1.68 1 


11 


8 


SEQ ID NO: 9053 


0.09 % 


1 1.5 


12 


15 


SEQ ID NO: 9054 


0.09 % 


1.5 


13 


23 


SEQ ID NO: 9055 


0.09 % 


1.5 


14 


27 


SEQ ID NO: 9056 


0.08 % 


1.4 


15 


24 


SEQ ID NO: 9057 


0.07 % 


1.2 


16 


7 


SEQ ID NO: 9058 


0.06 % 


1 


17 


17 


SEQ ID NO: 9059 


0.06 % 


1 


18 


10 


SEQ ID NO: 9060 


0.05 % 


0.9 


19 


39 


SEQ ID NO: 9061 


0.04 % 


0.792 


20 


47 


SEQ ID NO: 9062 


0.04 % 


0.792 



HLA A24 - 10 mers 


Maximum possible score using tliis molecule type 


1 1596.672 


Rank 


Start position 


Sequence 


% of max. score 


1 Score 


1 


5 


SEQ ID NO: 9063 


2.63 % 


1 42 


2 


34 


SEQ ID NO: 9064 


0.54 % 


1 8.64 


3 


30 


SEQ ID NO: 9065 


0.52 % 


1 8.4 


4 


19 


SEQ ID NO: 9066 


0.50 % 


1 8 


5 


50 


SEQ ID NO: 9067 


0.33 % 


i 5.28 


6 


12 


SEQ ID NO: 9068 


0.26 % 


1 4.2 


7 


31 


SEQ ID NO: 9069 


0.21 % 


3.36 


8 


26 


SEQ ID NO: 9070 


0.15 % 


2.52 
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9 


8 


SEQ ID NO: 9071 


0.13 % 


2.1 


10 


22 


SEQ ID NO: 9072 


0.12 % 


2 


11 


23 


SEQ ID NO: 9073 


0.11 % 


1.8 


12 


6 


SEQ ID NO: 9074 


0.09 % 


1.5 


13 


14 


SEO ID NO: 9075 


0.09 % 


1.5 


14 


16 


SEQ ID NO: 9076 


0.09 % 


1.5 


15 


7 


SEQ ID NO: 9077 


0.06 % 


1 


16 


1 f 


SEQ ID NO: 9078 


0.04 % 


0;75 


17 


1 0 


SEQ ID NO: 9079 


0.04 % 


0.72 


18 


1 9 


SEQ ID NO: 9080 


0.04 % 


0.72 


19 


i 47 


SEQ ID NO: 9081 


0.04 % 


0.66 


20 


39 


SEQ ID NO: 9082 


0.03 % • 


0.6 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


1 3925227.1 


Rankli Start position 


Sequence 


% of max. score 


1 Score 


1 II 15 


SEQ ID NO: 9083 


0.00 % 


1 14.1442686 


2 11 27 


SEQ ID NO: 9084 


0.00 % 


1 9.598176 


3 


1 22 


SEQ ID NO: 9085 


0.00 % 


1 9.5634 


4 


1 9 


SEQ ID NO: 9086 


0.00 % 


1 5.546246013 


5 


1 2 


SEQ ID NO: 9087 


0.00 % 


1 5.526462816 


6 


1 24 


SEQ ID NO: 9088 


0.00 % 


i 4.88163753 


7 


L U_ 


SEQ IP NO: 9089 


0.00 % 


i 3.699285408 


8 


1 31 ■ 


SEQ ID NO: 9090 


0.00 % 


! 2.29699206 


9 


1 6 


SEQ ID NO: 9091 


0.00 % 


1 2.0016040674 


10 


1 ■ 7 


SEQ ID NO: 9092 


0.00 % 


1 0.91287 


11 


1 49 


SEQ ID NO: 9093 


0.00 % 


0.71805678 


NH 


16 


SEQ ID NO: 9094 


0.00 % 


0.6694257042 




12 


SEQ ID NO: 9095 


0.00 % 


0.6539828625 




HLA A 0201 - 10 mers 


l^aximum possible score using this molecule type 


3925227.1 


Rankli Start position 


Sequence 


% of max. score 


Score 


1 II 16 


SEQ ID NO: 9096 


0.00 % 


34.28765802 


2 


19 


SEQ ID NO: 9097 


0.00 % 


18.9368775 


3 1 


14 


SEQ ID NO: 9098 


0.00 % 


14.1442686 


4 1 


27 


SEQ ID NO: 9099 


0.00 % 


11.406528 


5 i 


26 


SEQ ID NO: 9100 


0.00 % 


10.9304361558 


6 1 


34 


SEQ ID NO: 9101 


0.00 % 


5.580927 
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7 


6 


SEQ ID NO: 9102 


0.00 % 


4.865742 


8 


9 


SEQ ID NO: 9103 


0.00 % 


2.64106953 


9 


50 


SEQ ID NO: 9104 


0.00 % 


2.6275752 1 


10 1 


30 


SEQ ID NO: 9105 


0.00 % 


2.29699206 1 


11 II 7 


QPO TPV MO- oirkc 

sc^^ iL> iMU. yiuo 


0.00 % 


0.86083641 I 


1 12 1 


42 


SEQ ID NO: 9107 


0.00 % 


0.7049592 1 


13 1 


22 


SEQ ID NO: 9108 


0.00 % 


0.6628440357 1 


14 1 


2 


SEQ ID NO: 9109 


0.00 % 


0.6530644656 



HLA A 1101 - 9 mers 


Maximi 


jm possible score u 


sing this molecule type 


36 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


37 


SEQ ID NO: 9110 


15% 


5.4 


.,2,„ 


38 


SEQ ID NO: 9111 


2.22 % 


0.8 






1 HLA A 1101 - 10 mers I 


iMaxImum possible score uslnq this molecule type 


36 




1 Start position 


Sequence 


°/o of max. score 


Score 

2.7 


1 1 II 37 


SEQ ID NO: 9112 


7.5 % 




II HLA B7 - 9 mers 


Maxlmt 


m possible score uslnq this molecule tvoe 


5400 


Rank 


Start position 


1 Sequence H o/o of max. score 


Score 


1 


35 


SEQ ID NO: 9113 J 3.70% 


200 


2 


17 


SEQ ID NO: 9114 II 0.11 % 


6 


3 


6 


SEQ ID NO: 9115 II 0.07% 


4 


4 


20 


SEQ ID NO: 9116 11 0.07% 


4 


5 


31 


SEQ ID NO: 9117 II 0.07% 


4 


6 


43 


SEQ ID NO: 9118 II 0.07 % 


4 


■7 


7 1 


SEQ ID NO: 9119 II 0.03% 


2 


8 


23 1 


SEQ ID NO: 9120 II 0.02 % 


1.2 


9 


24 1 


SEQ ID NO: 9121 II 0.02 % 


1.2 


10 


10 1 


SEQ ID NO: 9122 II 0.01 % 


0.9 




HLA B7 - 10 mers 


Maximu 


m possible score using this molecule tvoe 


5400 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


35 


SEQ ID NO: 9123 


0.09 % 


5 


2 


19 


SEQ ID NO: 9124 


0.07 % 


4 
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3 


30 


SEO ID NO: 9125 


0.07 % 


4 


4 


34 


SEO ID NO: 9126 


0.07 % 


4 


5 


7 


SEO ID NO: 9127 


0.03 % 


2 


6 


16 


SEO ID NO: 9128 


0.03 % 


1.8 


7 


23 


SEO ID NO: 9129 


0.02 % 


1.2 


8 


50 


SEO ID NO: 9130 


0.02 % 


1.2 


9 


9 


SEO ID NO: 9131 


0.01 % 


1 


Table 22: Epitopes for SEQ ID NO: 6048 


HLA Al - 9 mers I 


Maximum possible score using tills molecule type 


5625 1 


Rank 


Start position 


Sequence 


°/o of max. score 


Score J 


1 


66 


SEQ ID NO: 9132 


0.44 % 


25 1 


2 


80 


SEQ ID NO: 9133 


0.08 % 


5 1 


3 


93 


SEQ ID NO: 9134 


0.04 % 


2.7 


4 


11 


SEQ ID NO: 9135 


0.04 % 


2.5 


5 


89 


SEQ ID NO: 9136 


0.04 % 


2.25 


6 


48 


SEQ ID NO: 9137 


0.01 % 


1 


7 


3 


SEQ ID NO: 9138 


0.00 % 


0.5 


8 


9 


SEQ ID NO: 9139 


0.00 % 


0.5 


9 


56 


SEQ ID NO: 9140 


0.00 % 


0.5 


10 


101 


SEQ ID NO: 9141 


0.00% 


0.5 


11 


106 


SEQ ID NO: 9142 


0.00 % 


0.5 


12 


110 


SEQ ID NO: 9143 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


30 


SEQ ID NO: 9144 


0.4 % 


22.5 


2 


88 


SEQ ID NO: 9145 


0.12 % 


6.75 


3 


48 


SEQ ID NO: 9146 


0.04 % 


2.5 


4 


55 


SEQ ID NO: 9147 


0.02 % 


1.25 


5 


13 


SEQ ID NO: 9148 


0.01 % 


0.9 


6 


79 


SEQJD NO: 9149 


0.01 % 


0.75 


7 


93 


SEQ ID NO: 9150 


0.01 % 


0.675 


8 


2 


SEQ ID NO: 9151 


0.00 % 


0.5 


9 


8 


SEQ ID NO: 9152 


0.00 % 


0.5 
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10 


65 


SEQ ID NO: 9153 


0.00 % 


0.5 


11 


66 


SEQ ID NO: 9154 


0.00 % 


0.5 


12 


80 


SEQ ID NO: 9155 


0.00 % 


0.5 


13 


105 


SEQ ID NO: 9156 


0.00 % 


0.5 


14 


109 


SEQ ID NO: 9157 


0.00 % 


0.5 






1 HLA A3 - 9 mers 


iMaximum possible score usinq this molecule type 


1 12150 




1 Start position 


Sequence || Wo of max. score 


1 Score 


1 


109 


SEQ ID NO: 9158 || 0.74 % 


1 90 


2 


3 


SEQ ID NO: 9159 j 0.24 % 


30 


3 


111 


SEQ ID NO: 9160 !l 0.12 % 


rrr= 


4 


106 


SEQ ID NO: 9161 11 0.07 % 


1 9 


5 


95 


SEQ ID NO: 9162 !| 0.05 % 


1 6.075 


6 


101 


SEQ ID NO: 9163 || 0.04 % 


1 6 


7 


110 


SEQ ID NO: 9164 || 0.02 % 


1 3.6 


8 


84 


SEQ ID NO: 9165 || 0.02 % 


1 3 


9 


80 


SEQ ID NO: 9166 || 0.02 % 


1 2.7 


10 


37 


SEQ ID NO: 9167 || 0.01 % 


1 2.25 


11 


9 


SEQ ID NO: 9168 || 0.01 % 


1 2 


12 


54 


SEQ ID NO: 9169 II 0.01 % 


1 2 


13 


99 


SEQ ID NO: 9170 || 0.01 % 


1 1.35 


14 


1 


SEQ ID NO: 9171 || 0.01% 


1 1.215 


15 


11 


SEQ ID NO: 9172 il 0.00% 


1 0.9 


16 


15 


SEQ ID NO: 9173 


0.00 % 


1 0.9 


17 


69 


SEQ ID NO: 9174 


0.00 % 


1 0.6 


18 


5 


SEQ ID NO: 9175 


0.00 % 


1 0,54 


19 


103 


SEQ ID NO: 9176 


0.00 % 


1 0.54 




HLA A3 - 10 mers 


Maximu 


m possible score using tills molecule type 


12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


75 


SEQ ID NO: 9177 


0.49 % 


60 


2 


109 


SEQ ID NO: 9178 


0.29 % 


36 


3 


22 


SEQ ID NO: 9179 


0.14 % 


18 


4 


15 


SEQ ID NO: 9180 


0.04 % 


6 


5 


110 


SEQ ID NO: 9181 


0.01 % 


2.25 


6 


95 


SEQ ID NO: 9182 


0.01 % 


1.8 


7 


101 II SEQ ID NO: 9183 


0.01 % 


1.35 
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8 


43 


SEO ID NO: 9184 


0.00 % 


1 


9 


2 


SEQ ID NO: 9185 


0.00 % 


0.9 


10 


5 


SEO ID NO: 9186 


0.00 % 


0.9 


11 


7 


SEO ID NO: 9187 


0.00 % 


0.9 


12 


107 


SEQ ID NO: 9188 


0.00 % 


0.9 


13 


102 


SEQ ID NO: 9189 


0.00 % 


0.81 


14 


3 


SEQ ID NO: 9190 


0.00 % 


0.75 


15 


8 


SEQ ID NO: 9191 


0.00% 


-0.6 


16 


103 


SEQ ID NO: 9192 


0.00 % 


0.54 




HLA A24 - 9 mers 


Maximum possible score using this molecule type 


1596.672 


Rank 


Start positiorL 


Sequence 


*Vb of max. score 


Score 


1 


88 


.SEQ ID NO: 9193 


1.66 % 


26.6112 


2 


77 


SEQ ID NO: 9194 


0.77 % 


12.32 


3 


18 


SEQ ID NO: 9195 




0.56 % 


9 


4 


108 


SEQ ID NO: 9196 




0.56 % 


9 


5 


92 


SEQ ID NO: 9197 


0.54 % 


8.64 


6 


96 


SEQ ID NO: 9198 


0.54 % 


8.64 


7 


73 


SEQ ID NO: 9199 


0.46 % 


7.5 


8 


40 


SEQ ID NO: 9200 


0.45 % 


7.2 


9 


104 


SEQ ID NO: 9201 


0.42 % 


6.72 


10 


8 


SEQ ID NO: 9202 


0.41 % 


6.6 


11 


21 


SEQ ID NO: 9203 


0.37 % 


6 


12 


102 


SEQ ID NO: 9204 


0.37 % 


6 


HN 


22 


SEQ ID NO: 9205 


0.25 % 


4 




68 


SEQ ID NO: 9206 


0.25 % 


4 


15 


106 


SEQ ID NO: 9207 


0.22 % 


3.6 


16 


1 


SEQ ID NO: 9208 


0.18 % 


3 




h-iH 


SEQ ID NO: 9209 


0.18 % 


3 






SEQ ID NO: 9210 


0.18 % 


3 




1 101 


SEQ ID NO: 9211 


0.18 % 


3 




37 


SEQ ID NO: 9212 


0.15 % 


2.4 



HLA A24 - 10 mers 


Maximum possible score using this molecule type 


1596.672 


Rank II Start position 


Sequence 


°/o of max. score 


Score 


1 II 100 


SEQ ID NO: 9213 


0.93 % 


15 


2 II 18 


SEQ ID NO: 9214 


0.78 % 


12.6 
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3 


98 


SEO ID NO: 9215 


0.52 % 


8.4 


4 


73 


SEO ID NO: 9216 


0.46 % 


7.5 


5 


91 


SEO ID NO: 9217 


0.45 % 


7.2 


6 


103 


SEQ ID NO: 9218 


0.42 % 


6.72 


7 


7 


SEQ ID NO: 9219 


0.41 % 


6.6 


8 


21 


SEQ ID NO: 9220 


0.37 % 


6 


9 


46 


SEQ ID NO: 9221 


0.37 % 


6 


10 


93 


SEQ ID NO: 9222 


0.37 % 


6 


11 


96 


SEQ ID NO: 9223 


0.37 % 


6 


12 


101 


SEQ ID NO: 9224 


0.37 % 


6 


13 


77 


SEQ ID NO: 9225 


0.25 % 


4 


14 


92 


SEQ ID NO: 9226 


0.22 % 


3.6 


15 


105 


SEQ ID NO: 9227 


0.22 % 


3.6 


16 


2 


SEQ ID NO: 9228 


0.18 % 


3 


17 


53 


SEQ ID NO: 9229 


0.18 % 


3 


18 


36 1 


SEQ ID NO: 9230 


0.12 % 


2 


19 


55 1 


SEQ ID NO: 9231 


0.12 % 


2 


20 


102 


SEQ ID NO: 9232 


0.11 % 


1.8 







HLA A 0201 - 


9 mers 




Maxim 


um possible score using this molecule type 


1 3925227.1 


Rank 


Start position 


Sequence 


*Vb of max. score 


1 Score 


1 


84 


SEQ ID NO: 9233 


0.01 % 


1 441.342216 


2 


102 


SEQ ID NO: 9234 


0.00 % 


1 63.16728165 


3 


107 


SEQ ID NO: 9235 


0.00 % 


1 51.882640425 


4 


1 


SEQ ID NO: 9236 


0.00 % 


1 43.8816609 


5 


95 


SEQ ID NO: 9237 


0.00 % 


1 33.40165248 


6 


2 


SEQ ID NO: 9238 


0.00 % 


1 24.66305226 


7 


92 


SEQ ID NO: 9239 


0.00 % 


I 22.64458905 


8 


103 


SEQ ID NO: 9240 I 


0.00 % 


1 20.70206586 


9 


47 


SEQ ID NO: 9241 | 


0.00 % 


11.175953184 


10 


94 


SEQ ID NO: 9242 i 


0.00 % 


8.452983 


11 


15 


SEQ ID NO: 9243 


0.00 % 


8.1793152 j 


12 


8 


SEQ ID NO: 9244 1 


0.00 % 


4.993461 1 


13 


5 


SEQ ID NO: 9245 


0.00 % 


4.57284528 I 


14 


99 


SEQ ID NO: 9246 1 


0.00 % 


3.999468528 I 


15 


105 


SEQ ID NO: 9247 1 


0.00 % 


2.231322 1 


16 


20 


SEQ ID NO: 9248 1 


0.00 % 


1.3524 1 


17 


62 


SEQ ID NO: 9249 1 


0.00 % 


0.8631693 1 
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18 


6 


SEO ID NO: 9250 


0.00 % 


0.824619 


19 


57 


SEO ID NO: 9251 


0.00 % 


0.72105 


20 


58 


SEO ID NO: 9252 


0.00 % 


0.7147572 1 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


1 *>/o of max. score 


Score 


1 


101 


SEQ ID NO: 9253 


1 0.03 % 


1243.078056 


2 
3 


3 


SEQ ID NO: 9254 


1 0.01 % 


592.944462 


106 


SEQ ID NO: 9255 


0.00 % 


94.2678 


4 


5 


SEQ ID NO: 9256 


1 0.00 % 


43.42032 


5 


107 


SEQ ID NO: 9257 


1 0.00 % 


33.30332334 


6 


102 


SEQ ID NO: 9258 


1 0.00 % 


32.53181778 


7 


54 


SEQ ID NO: 9259 


1 0.00 % 


27.324 


8 


7 


SEQ ID NO: 9260 


1 0.00% 


21.3624 


9 


1 


SEQ lb NO: 9261 


0.00% 


13.723479 


10 


95 


SEQ ID NO: 9262 


0.00 % 


13.00344192 


11 


94 


SEQ ID NO: 9263 


0.00 % 


10.01276388 


12 


99 


SEQ ID NO: 9264 


0.00 % 


5.6615328 


13 


39 


SEQ ID NO: 9265 


0.00 % 


3.6304212 


14 


111 


SEQ ID NO: 9266 


0.00 % 


2.53368 


15 


103 


SEQ ID NO: 9267 


0.00 % 


2.475394803 


16 


14 


SEQ ID NO: 9268 


0.00 % 


2.4519012 


17 


19 


SEQ ID NO: 9269 


0.00 % 


2.07604992 


18 


29 


SEQ ID NO: 9270 


0.00 % 


1.8179154 


19 


57 


SEQ ID NO: 9271 


0.00 % 


1.52076 


20 


47 


SEQ ID NO: 9272 


0.00 % 


1 1.27712376 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


o/o of max. score 


Score 


1 


80 


SEQ ID NO: 9273 


3.33 % 


1.2 


2 


69 


SEQ ID NO: 9274 


1.66 % 


0.6 


3 


109 


SEQ ID NO: 9275 


1.66 % 


0.6 



HLA A 1101 -- 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position || Sequence || 9/6 of max. score 


Score 




1 22 II SEQ ID NO: 9276 11 11.11 % 


4 
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11 HLAB7-9mers 


llMaximum possible score usinq this molecule type 


1 5400 




1 Start position 


Sequence 


9/o of max. score 


1 Score 


1 


1 22 


SEQ ID NO: 9277 


3.70 % 


200 


2 


1 77 


SEQ ID NO: 9278 


2.22 % 


1 120 


3 


1 104 


SEQ ID IMO: 9279 


0.22 % 


1 12 


4 


1 40 


SEQ ID NO: 9280 


0.11 % 


1 - • 6 ■• 


5 


1 8 


SEQ ID NO: 9281 


0.07 % 


4 


6 


1 21 


SEQ ID NO: 9282 


0.07 % 


4 


7 


68 


SEQ ID NO: 9283 


0.07 % 


4 


8 


92 


SEQ ID NO: 9284 


0.07 % 1 


4 


9 


102 


SEQ ID NO: 9285 


0.07 % 1 




10 


46 


SEQ ID NO: 9286 


0.03 % 1 






98 


SEQ ID NO: 9287 


0.03 % 1 




12 


103 




0.03 % 1 


2 


13 


88 


SEQ ID NO: 9289 


0.02% 1 


1.2 


14 


105 


SEQ ID NO: 9290 


0.01 % 1 


0.9 


15 


43 


SEQ ID NO: 9291 


0.01 % 1 


0.6 


16 


79 


SEQ ID NO: 9292 


0.01 % 1 


0.6 


17 


95 


SEQ ID NO: 9293 


0.01 % 1 


0.6 


18 


107 


SEQ ID NO: 9294 


0.00% 1 


0.5 



i HLA B7 - 10 mers 


[Maximu 


m possible score usInq this molecule type 


5400 


[ Rank 


Start position 


1 Sequence 


1 9/6 of max. score 


Score 


I 1 


46 


1 SEQ ID NO: 9295 


1 1.48 % 


80 




98 


1 SEQ ID NO: 9296 


1 1.48 % 


80 


1 3 


91 


1 SEQ ID NO: 9297 


0.37 % 


20 


1 4 


103 


i SEQ ID NO: 9298 


0.37 % 


20 1 


5 


7 


SEQ ID NO: 9299 


0.07 % 


4 


6 


21 


SEQ ID NO: 9300 


0.07 % 




7 


101 


SEQ ID NO: 9301 1 


0.07 % 


4 


8 


107 


SEQ ID NO: 9302 1 


0.03 % 


2 


9 


67 


SEQ ID NO: 9303 1 


0.02 % 


1.2 1 


10 


93 


_SEQ ID NO: 9304 I 


0.02 % 


1.2 1 


11 


69 1 


_ SEQ ID NO: 9305 1 


0.01 % 


1 1 


12 


39 i 


SEQ ID NO: 9306 1 


0.01 % 


0.6 1 


13 


77 1 


SEQ ID NO: 9307 1 


0.01 % 


0.6 1 



-459- 



BNSDCXJID: <WO__2004092360A2J_> 



wo 2004/092360 



PCT/US2004/011710 



i 14 II 22 il SEQIDNO:9308 II 0.00% || 0.5 j| 



Table 23: Epitopes for SEQ ID NO: 6049 



HLA Al - 9 mers 


Maximum possible score using tliis molecule type 


5625 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


0 


SEO ID NO: 9309 


0.2 % 


11.25 


2 


35 


SEQ ID NO: 9310 


0.01 % 


0.9 


3 


4 


SEQ ID NO: 9311 


0.00 % 


0.5 


4 


5 


SEQ ID NO: 9312 


0.00 % 


. 0.5 


5 


10 


SEQ ID NO: 9313 


0.00 % 


0.5 


6 


19 


SEQ ID NO: 9314 


0.00 % 


0.5 


7 


21 


SEQ ID NO: 9315 


0.00% 


0.5 



5 



HLA Al - 10 mers 


Maximum possible score using tliis molecule type 


i 5625 


Ranic 


Start position 


Sequence 


9/o of max. score 


LScore 


1 


0 


SEQ ID NO: 9316 


0.2 % 


Lll.25 


Z 


5 


SEQ ID NO: 9317 


0.04 % 


1 2.5 


3 


33 


SEQ ID NO: 9318 


0.02 % 


1 1.5 


4 


3 


SEQ ID NO: 9319 


0.02 % 


1 1.25 


5 


9 


SEQ ID NO: 9320 


0.00 % 


1, 0.5 


6 


18 


SEQ^ID NO: 9321 


0.00 % 


1 0.5 


7 


20 


SEQ ID NO: 9322 


0.00 % 


1 0.5 




HLA A3 - 9 mers 


Maximum possible score using tiiis molecule type 


12150 


Rank || Start position 


Sequence 


°/o of max. score 


Score 


1 


4 


SEQ ID NO: 9323 


0.14 % 


18 


2 


16 


SEQ ID NO: 9324 


0.11 % 


13.5 


3 


23 


SEQ ID NO: 9325 


0.06 % 


8.1 


4 


18 


SEQ ID NO: 9326 


0.03 % 


4.05 


5 


21 


SEQ ID NO: 9327 


0.01 % 


2.025 


6 II 9 


SEQ ID NO: 9328 


0.01 % 


1.8 


7 11 15 


SEQ ID NO: 9329 


0.01 % 


1.8 


8 11 25 


SEQ ID NO: 9330 


0.01 % 


1.8 


9 n 12 


SEQ ID NO: 9331 


0.00 % 


0.9 



-460- 



BNSDOCID: <WO___2004O92360A2J_> 



wo 2004/092360 



PCT/DS2004/011710 



10 


19 


SEO ID NO: 9332 


0.00 % 


0.9 


11 


20 


SEO ID NO: 9333 


0.00 % 


0.9 


12 


2 


SEO ID NO: 9334 


0.00 % 


0.81 


13 


22 


SEO ID NO: 9335 


0.00 % 


0.81 


14 


10 


SEQ ID NO: 9336 


0.00 % 


0.6 






HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


Sequence 


^/o of maXi score 




1 


20 


SEQ ID NO: 9337 


1 0.16 % 


20.25 


2 


9 


SEQ ID NO: 9338 


1 0.09 % 


12 


3 


16 


SEQ ID NO: 9339 


0.07 % 


9 


4 


18 


SEO ID NO: 9340 


0.07 % 


9 


5 


22 


SEQ ID NO: 9341 


0.06 % 


8.1 


6 


4 


SEQ ID NO: 9342 


0.03 % 


4.05 


7 


15 


SEQ ID NO: 9343 


0.03 % 


4.05 


8 


12 


SEQ ID NO: 9344 


0.02 % 


3.6 


9 


3 


SEQ ID NO: 9345 


0.00 % 


0.9 


10 


33 


1 SEQ ID NO: 9346 


0.00 % 


0.6 


11 


2 


SEQ ID NO: 9347 


0.00 % 


0.54 


12 


24 


SEQ ID NO: 9348 


0.00 % 


0.54 




1 HLA A24 - 9 mers 


iMaximum possible score using this molecule tv 


pe 


1596.672 


1 Rank 


Start position 


Sequence 


<Vb of max. score 


Score 


1 1 


8 


SEQ ID NO: 9349 


18.78 % 


300 


1 2 


11 


SEQ ID NO: 9350 


1.87 % 


30 


\ 3 


[ 28 


SEQ ID NO: 9351 


1.50 % 


24 


1 4 


7 


SEQ ID NO: 9352 


0.75 % 


12 


1 5 


17 


SEQ ID NO: 9353 


0.56 % 


9 


1 6 


14 


SEQ ID NO: 9354 


0.46 % 


7.5 


1 7 


23 


SEQ ID NO: 9355 


0.37 % 


6 


1 8 


13 


SEQ ID NO: 9356 


0.36 % 


5.76 


1 9 


2 


SEQ ID NO: 9357 


0.35 % 


5.6 


1 10 


16 


SEQ ID NO: 9358 


0.35 % 


5.6 


1 11 


9 


SEQ ID NO: 9359 


0.30 % 


4.8 


1 12 1 


21 


SEQ ID NO: 9360 


0.26 % 


4.2 


1 13 1 


5 


SEQ ID NO: 9361 


0.25 % 


4 


Ll4 J 


4 


SEQ ID NO: 9362 


0.22 % 


3.6 



-461- 



BNSDCXJID: <WO ^200409236QA2_I_> 



wo 2004/092360 



PCT/US2004/011710 



15 


0 


SEO ID NO: 9363 


0.18 % 


3 


16 


19 


SEO ID NO: 9364 


0.18 % 


3 


17 


10 


SEO ID NO: 9365 


0.15 % 


2.4 


18 


18 


SEO ID NO: 9366 


0.13 % 


2.1 


19 


25 


SEO ID NO: 9367 


0.06 % 


1.1 


20 


15 


SEQ ID NO: 9368 


0.05 % 


0.9 




HLA A24 - lO mers 


Maximum possible score using this molecule type 


1 1 =>Qfi fi77 


Rank 


Start position 


SGQUcnce 


^/o of maXs score 




1 


8 


SEO ID NO- 9369 


22.54 % 


1 360 


2 


7 


SEQ ID NO: 9370 


1.25 % 




3 


17 


SEO TD NO" Q371 


0.65 % 


10.5 


4 


15 


SEO TD NO- Q377 


0.52 % 


8.4 


5 


4 


SEQ ID NO: 9373 


0.45 % 


[ 7 _ 2 


6 


22 


SEQ ID NO: 9374 


0.37 % 


1 ~6 


7 


12 


SEQ ID NO: 9375 


0.36 % 


i 5.76 J 


8 


27 


SEQ ID NO: 9376 


0.30 % 


I .4.8 ■ 1 


9 


14 


SEQ ID NO: 9377 


0.28 % 


1 4.5 1 


10 


20 


SEQ ID NO: 9378 


0.26 % 


4.2 1 


11 


10 


SEQ ID NO: 9379 


0.25 % 


4 1 


12 


3 


SEQ ID NO: 9380 


0.18 % 


3 


13 


18 


SEQ ID NO: 9381 


0.18 % 


3 


14 


9 


SEQ ID NO: 9382 


0.15% 


2.4 


15 


24 


SEQ ID NO: 9383 


0.10 % 


1.65 


16 


16 


SEQ ID NO: 9384 


0.07 % 


1.2 1 


17 


13 


SEQ ID NO: 9385 


0.06 % 


1 1 


18 


11 


SEQ ID NO: 9386 


0.05 % 


0.9 1 


19 


1 


SEQ ID NO: 9387 


0.05 % 


0.84 1 



HLA A 0201 - 9 mers 


Maximum possible score using tills molecule type 


3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


12 


SEQ ID NO: 9388 


0.10% 


4267.988928 


2 


23 


SEQ ID NO: 9389 


0.03 % 


1360.69088544 


3 


9 


SEQ ID NO: 9390 


0.01 % 


569.948832 


4 


16 


SEQ ID NO: 9391 


0.00 % 


309.0498408 


5 


15 


SEQ ID NO: 9392 


0.00 % 


79.73570448 


6 « 2 


SEQ ID NO: 9393 


0.00 % 


51.109542 



-462- 



BNSDOCID: <WO____20<M092360A2_L> 



wo 2004/092360 



PCT/US2004/011710 








HLA A 0201 


- 10 mers 




MaxiiT 


urn possible score using tliis molecule tvoe 


1 3925227.1 


R.3nk 


Start position 


i Sequence 


9/6 of max. score 


Score 


1 


22 


|SEQ ID NO: 9407 


0.09 % 


3636.068421648 1 


2 


4 


ISEQ ID NO: 9408" 


0.02 % 


1107.96-0876 


3 


15 


ISEQ ID NO: 9409 


0.02 % 


836.2525104 


4 


16 


1 SEQ ID NO: 9410 1 


0.00 % 


150.9313176 


5 


12 


ISEQ ID NO: 9411 


0.00 % 


76.55002416 


6 




ISEQ ID NO: 9412 


0.00 % 


49.0273014 


7 




ISEQ ID NO: 9413 


0.00 % 


42.1638414747 


8 


20 


SEQ ID NO: 9414 


0.00 % 


9.29480508 


9 


24 


SEQ ID NO: 94151 


0.00 % 


9.2669346 


10 


13 


SEQ ID NO: 94161 


0.00 % 


7.96581954 


11 


21 


SEQ ID NO: 94171 


0.00 % 


5.051306761875 


12 


5 1 


SEQ ID NO: 94181 


0.00 % 


2.6941464 1 


13 


11 i 


SEQ ID NO: 9419 


0.00 % 


2.3839914 1 


14 


34 
2 


SEQ ID NO: 9420 1 
SEQ ID NO: 9421 1 


0.00 % 1 


1.465422 1 


m 


9 


_^EQ ID NO: 94221 


0.00 % 
0.00 % 


0.70794 1 
0.6513048 1 




19 


SEQ ID NO: 9423 


0.00 % 


0.51882640425 1 



^_=_ HLA A 1101 -9 mers 

Maximum possible score using this molecule type 



Start position I j Sequence || °/o of max. s^5re 




-463- 



BNSDCWID: <WO__2004092360A2J_> 



wo 2004/092360 



PCT/US2004/011710 



HLA A 1101 - 10 mers 


Maximum possible score using tliis molecule type 


36 


Rank 


1 Start position 


Sequence 


% of max. score 


Score 


1 


1 33 


SEQ ID NO: 9424 


1.66 % 


0.6 






HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start position 


1 Sequence 


9/6 of max. score 


Score 


1 


13 


1 SEQ ID NO: 9425 


0.22 % 


12 


2 


2 


1 SEQ ID NO: 9426 


0.07 % 


4 


3 


9 


1 SEO ID NO: 9427 


0.07 % 


4 


4 


16 


1 SEQ ID NO: 9428 


0.07 % 


4 


5 


23 


1 SEQ ID NO: 9429 


0.07% 


4 


6 


5 


1 SEQ ID NO: 9430 


0.02 % 


1.2 


7 


15 


1 SEO ID NO: 9431 


0.01 % 


1 






HLA B7 > 10 mers 


Maximum possible score using this molecule type 


54,00 


Rank 


Start position 


Sequence 


9/o of max. score 


Score 


1 


4 


SEQ ID NO: 9432 


0.07 % 


4 


2 


10 


SEQ ID NO: 9433 


0.07 % 


4 


3 


12 


1 SEQ ID NO: 9434 


0.07 % 


4 


4 


15 


SEQ ID NO: 9435 


0.07 % 


4 


5 


22 


SEQ ID NO: 9436 


0.07 % 


4 


6 


13 


SEQ ID NO: 9437 


0.02 % 


1.2 


Table 24: Epitopes for SEQ ID NO: 6050 




HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


*>/o of max. score 


Score 


1 


47 


SEQ ID NO: 9438 


0.01 % 


0.75 


2 


21 


SEQ ID NO: 9439 


0.00 % 


0.5 


3 


53 


SEQ ID NO: 9440 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


II 5625 


Rank 1 


Start position 


Sequence | 


°/o of max. score I 


Score 



-464- 



BNSCXICID: <WO___2004O82360A2J_a 



wo 2004/092360 



PCT/US2004/011710 



1 


16 


SEO ID NO: 9441 


0.04 % 


2.5 


2 


71 


SEO ID NO: 9442 


0.04 % 


2.5 


3 


47 


SEO ID NO: 9443 


0.02 % 


1.5 


4 


62 


SEQ ID NO: 9444 


0.01 % 


0.9 


5 


20 


SEQ ID NO: 9445 


0.00 % 


0.5 


6 


38 


SEQ ID NO: 9446 


0.00 % 


0.5 






L_ 




H LA A3 - 9 mers 




Maximum possible score usinq this molecule type 


12150 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


54 


SEQ ID NO: 9447 


0.02 % 


2.7 


2 


17 


SEQ ID NO: 9448 


0.01 % 


2 


3 


3 


SEQ ID NO: 9449 


0.01 % 


1.8 



HLA A3 - 10 mers 


Maximu 


m possible score usinq this molecule type 


12150 


Ranic 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


! 22 


SEQ ID NO: 9450 


0.09 % 


12 




16 


SEQ ID NO: 9451 


0.01 % 


2 
0.9 




^ 54 


SEQ ID NO: 9452 


0.00 % 



HLA A24 - 9 mers 


Maximi 


jm possible score i 


jsing this molecule type 


1 1596.672 


Rank 


Start position 


Sequence 


9/o of max. score 


1 Score 


1 


70 


SEQ ID NO: 9453 


2.10 % 


1 33.6 


2 


7 


SEQ ID NO: 9454 


1.12 % 


1 18 


3 


60 


SEQ ID NO: 9455 


0.46 % 


1 7.5 


4 


54 i 


SEQ ID NO: 9456 


0.37 % 


1 6 


5 


14 , 


SEQ ID NO: 9457 


0.31 % 


1 5 


6 


19 


SEQ ID NO: 9458 


0.30 % 


1 4.8 


7 


47 


SEQ ID NO: 9459 


0.30 % 


4.8 


8 


12 1 


SEQ ID NO: 9460 


0.25 % 


4 


9 


15 1 


LSEQ ID NO: 9461 


0.25 % 




10 J 


67 1 


SEQ ID NO: 9462 


0.25 % 




11 1 


! 


SEQ ID NO: 9463 


0.18 % 


3 




37 1 


SEQ ID NO: 9464 


0.06 % 


1 


13 1 


1 


SEQ ID NO: 9465 


0.03 % 1 


0.5 



ll HLA A24 10 mers 



-465- 



BNSDCXJID: <WO__20040923eaA2_l_> 



wo 2004/092360 



PCT/US2004/011710 



Maximum possible score using tiiis molecule type 


1596.672 


Rank 


Start position 


Sequence 


^/o of max. score 


Score 


1 


14 


SEO ID NO: 9466 


12.52 % 


200 


2 


7 


SEQ ID NO: 9467 


0.93 % 


15 


3 


11 


SEO ID NO: 9468 


0.75 % 


12 


4 


60 


SEO ID NO: 9469 


0.56 % 


9 


5 


18 


SEQ ID NO: 9470 


0.45 % 


7.2 


6 


46 


SEQ ID NO: 9471 


0.45 % 


T.I 


7 


53 


SEO ID NO: 9472 


0.37 % 


6 


8 


69 


SEQ ID NO: 9473 || 0.35 % 


5.6 


9 


66 


SEQ ID NO: 9474 j 0.25 % 


4 


10 


20 


SEQ ID NO: 9475 


0.12 % 


2 


11 


47 


SEQ ID NO: 9476 


0.07 % 


1.2 


12 


36 


SEQ ID NO: 9477 


0.06 % 


1 


13 


26 


SEQ ID NO: 9478 


0.04 % 


0.75 


14 


70 


SEQ ID NO: 9479 


0.04 % 


0.72 






HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


o/o Of max. score 


Score 


1 


54 


SEQ ID NO: 9480 


0.02 % 


881.199 


2 


26 


SEQ ID NO: 9481 


0.00 % 


95.013 


3 


61 


SEQ ID NO: 9482 


0.00 % 


93.69648 


4 


19 


SEQ ID NO: 9483 


0.00 % 


40.2894864 


5 


74 


SEQ ID NO: 9484 


0.00 % 


12.6684 


6 


35 


SEQ ID NO: 9485 


0.00 % 


10.34586 


7 


69 


SEQ ID NO: 9486 


0.00 % 


3.3704706 


8 


13 


SEQ ID NO: 9487 


0.00 % 


1.656 


9 


15 


SEQ ID NO: 9488 


0.00 % 


1.47537042 


10 


68 


SEQ ID NO: 9489 


0.00 % 


0.966 


11 


22 


SEQ ID NO: 9490 


0.00 % 


0.942678 


HH 


12 


SEQ ID NO: 9491 


0.00 % 


0.7669695 




36 


SEQ ID NO: 9492 


0.00 % 


0.52661835 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


*Vb of max. score 


Score 


1 


61 


SEQ ID NO: 9493 


0.00 % 


93.69648 


2 


25 


SEQ ID NO: 9494 


0.00 % 


63.33035625 



-466- 



BNSDOCID: <WO__2004092360A2J_> 



wo 2004/092360 



PCT/US2004/011710 




-467- 



BNSDOCID: <WO___2004092360A2J_> 



wo 2004/092360 



PCTAIS2004/011710 



1 HLA B7 - 10 mers 


IMaximum possible score usina this molecule type 


5400 


1 tvariK 


1 Start position 


li Sequence 


1 o/o of max. score 


Score 


1 1 


1 69 


SEQ ID NO: 9522 


1 0.66 % 


36 


1 2 


1 53 


SEQ ID NO: 9523 


1 0.22 % 


1 12 


3 


1 5 


SEO ID NO: 9524 


1 0.13% 


1 7.5 1 


5 


1 66 
11 


SEQ ID NO: 9525 


1 n 1 1 OA 

1 U.J.X VP 


1 - 6 


6 


27 


SEQ ID NO: 9526 
SEQ ID NO: 9527 


1 0.07 % 
1 0.07 % 


1 4 

1 4 1 


7 


46 


SEQ ID NO: 9528 


1 0.07 % 


4 1 


8 


18 • 


SEQ ID NO: 9529 


1 0.02 % 


1.2 1 


9 


9 


SEQ ID NO: 9530 


0.01 % 


1 


10 
11 


26 
25 


SEQ ID NO: 9531 i 
SEQ ID NO: 9532 I 


0.01 % 


1 


12 


17 


SEQ ID NO: 9533 I 


0.01 % 
0.01 % 


0.75 1 
0.6 


13 


36 


SEQ ID NO: 9534 1 


0.01 % 


0.6 


14 


68 


SEQ ID NO: 9535 1 


0.01 % 


0.6 




35 


SEO ID NO: 9536 1 


0.00 % 


0.5 




42 


SEQ ID NO: 9537 || 


0.00 % 


0.5 




73 


SEO ID NO: 9538 II 


0.00 % 


0.5 


Table 25: Epitopes for SEQ ID NO: 6052 





1 HLA Al - 9 mers 


iMaxImi 


im possible score using this molecule type 




II 5625 


1 Rank 


Start position 


II Sequence 


9/6 of max. score 


II Score 


1 1 


365 


II SEQ ID NO: 9539 


0.8 % 




1 2 


397 


II SEQ ID NO: 9540 


0.44 % 


II 25 


[ 3 


L_ 229 


II SEQ ID NO: 9541 


0.32 % 


J 18 


4 


103 


II SEQ ID NO: 9542 


0.17 % 


II 10 


5 


338 


II SEQ ID NO: 9543 


0.17 % 


H 10 


6 


251 


II SEQ ID NO: 9544 


0.16 % 


"11 9 


7 


79 


II SEQ ID NO: 9545 


0.11 % 


II 6.25 


8 


119 


II SEQ ID NO: 9546 


0.10 % 


1 6 


9 


361 


II SEO ID NO: 9547 


0.08 % 


1l 5 


10 


60 


II SEQ ID NO: 9548 


0.04 % 


II 2.25 




101 


II SEQ ID NO: 9549 


0.04 % 


II 2.25 


1 12 1 


278 


II SEQ ID NO: 9550 


0.04 % 


II 2.25 



wo 2004/092360 



PCT/US2004/011710 



13 II 23 


SEO ID NO: 9551 II 0.02 % 


1.25 


14 


1 164 


SEQ ID IMO: 9552 || 0.02 % 


1.25 


15 


1 165 


SEO ID NO: 9553 II 0.02% 


1.25 


16 


1 295 


SEQ ID NO: 9554 || 0.02 % 


1.25 


__17 


1 172 


SEO ID NO: 9555 II 0.01 % 


0.9 


18 


i 0 


SEO ID NO: 9556 tl 0.01 % 


0.75 


19 


1 311 


SEQ ID NO: 9557 || 0.01 % 


0.75 


20 


1 78 


SEQ ID NO: 9558 || 0.01% 


0.625 






HLA Al - 10 mers 


Maximi 


m possible score usinq this molecule type 


rrcT c 1 

ooZo 1 


Rank 


Start position 


Sequence || 9/6 of max. score 


Score 1 


1 


114 


SEQ ID NO: 9559 || 1.11 o/^ 


62.5 1 


2 


134 


SEQ ID NO: 9560 II 0.8 % 


45 1 


3 


365 


SEO ID NO: 9561 II 0.8 % 


1 45 1 


4 


77 


SEQ ID NO: 9562 || 0.66 % 


1 o/.b 1 


5 


103 


SEQ ID NO: 9563 II 0.44 % 


i 91:; "1 
I 25 1 


6 


23 


SEQ ID NO: 9564 II 0.22 % 




7 


338 


SEQ ID NO: 9565 II 0.17 % H 


ID 1 


8 


361 


SEQ ID NO: 9566 II 0.17 % 


10 1 


9 


324 


SEQ ID NO: 9567 11 0.11 % i 


6.25 1 


10 


375 


SEQ ID NO: 9568 II 0.11 % | 


6.25 1 


11 


79 


SEQ ID NO: 9569 II 0.04 o/o | 


2.5 1 


12 


295 


SEQ ID NO: 9570 II 0.04 % ~\ 


2.5 1 


13 


346 


SEQ ID NO: 9571 || 0.04 % 1 


■ 2.5 1 


14 


378 


SEQ ID NO: 9572 


1 0.03 % 1 


- 2 1 


15 


251 


SEQ ID NO: 9573 


1 0.03 % 1 


1.8 1 


16 


214 


SEQ ID NO: 9574 


1 0.02 % 1 


1.125 1 


17 


160 


SEQ ID NO: 9575 


1 0.01 % 1 


i 1 


18 1 


172 1 


SEQ ID NO: 9576 


1 0.01 % 1 


0.9 


19 1 


229 1 


SEQ ID NO: 9577 


1 0.01 % 1 


0.9 1 


20 1 


376 1 


SEQ ID NO: 9578 


! 0.01 % II 


0.9 




HLA A3 - 9 mers 


Maximu 


m possible score usinq this molecule tvoe 


12150 


Rank 


Start position 


Sequence 


9/6 of max. score 


Score 


1 


229 


SEQ ID NO: 9579 


0.49 % 


60 


2 


361 


SEQ ID NO: 9580 


0.27 % 


33.75 


3 


330 


SEQ ID NO: 9581 


0.16 % 


20 
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4 


218 


1 SEO ID NO: 9582 


1 0.09 % 


12 1 


5 


338 


1 SEO ID NO: 9583 


1 0.04 % 


6 


6 


352 


1 SEQ ID NO: 9584 


1 " 0.04 % 


6 1 


7 


103 


1 SEQ ID NO: 9585 


1 0.04 % 


5.4 


8 


291 


1 SEQ ID NO: 9586 


1 0.01 % 


2 


9 


241 


1 SEQ ID NO: 9587 


1 0.01 % 


1.8 


10 


290 


! SEQ ID NO: 9588 


1 0.01 % 


1.8 


11 


316 


; SEQ ID NO: 9589 


1 0.01 % 


1.8 


12 


222 


SEQ ID NO: 9590 


f 0.01 % 


1.35 


13 


266 


SEQ ID NO: 9591 


! 0.01 % 


1.35 


14 


53 


SEQ ID NO: 9592 


0.00 % 


1 


15 


100 


SEQ ID NO: 9593 


0.00 % 


0.9 


16 


138 


SEQ ID NO: 9594 


0.00 % 


0.9 


17 


240 


SEQ ID NO: 9595 


0.00 % 


0.9 


18 


119 


SEQ ID NO: 9596 


0.00 % 


0.675 


19 


44 


SEQ ID NO: 9597 


0.00 % 


0.6 


20 


161 


SEQ ID NO: 9598 


0.00 % 


0.6 




H LA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


Sequence 


o/o of max. score 


Score 


1 


338 


SEQ ID NO: 9599 


0.49 % 


60 


2 


160 


SEQ ID NO: 9600 


0.32 % 


40 


3 


352 


SEQ ID NO: 9601 


0.24 % 


30 


4 


361 


SEQ ID NO: 9602 


0.18 % 


22.5 


5 


103 


SEQ ID NO: 9603 


0.13 % 


16.2 


6 


290 


SEQ ID NO: 9604 


0.07 % 


9 


7 


351 


SEQ ID NO: 9605 


0.07 % 


9 


8 


44 


SEQ ID NO: 9606 


0.04 % 


6 


9 


228 


SEQ ID NO: 9607 


0.03 % 


4.05 


10 


394 


SEQ ID NO: 9608 


0.02 % 


3 


11 


240 


SEQ ID NO: 9609 


0.02 % 


2.7 


12 


100 


SEQ ID NO: 9610 


0.01 % 


1.8 


13 


114 


SEQ ID NO: 9611 


0.01 % 


1.8 


14 


93 


SEQ ID NO: 9612 


0.01 % 


1.5 


15 


134 


SEQ ID NO: 9613 


0.01 % 


1.5 


16 


221 


SEQ ID NO: 9614 


0.01 % 


1.35 


17 


330 


SEQ ID NO: 9615 


0.00 % 


1.2 


18 


112 


SEQ ID NO: 9616 


0.00 % 


0.9 



-470- 



BNSDCX:iD: <WO___2004092360A2J_> 



wo 2004/092360 



PCT/US2004/011710 
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wo 2004/092360 
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10 


277 


SEQ ID NO: 9648 


0.45 % 


7.2 


11 


150 


SEQ ID NO: 9649 


0.37 % 


6 


12 


152 


SEQ ID NO: 9650 


0.37 % 


6 


13 


158 


SEQ ID NO: 9651 


0.37 % 


6 


14 


171 


SEQ ID NO: 9652 


0.37 % 


6 


15 


343 


SEQ ID NO: 9653 


0.37 % 


6 


16 


110 


SEQ ID NO: 9654 


0.34 % 


5.5 


17 


85 


SEQ ID NO: 9655 


0.31 % 


5 


18 


47 I 


SEQ ID NO: 9656 1 


0.30 % 


4.8 


19 


213 i 


SEQ ID NO: 9657 1 


0.30 % 


4.8 


20 


218 1 


SEQ ID NO: 9658 | 


0.30 % 


4.8 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


222 


SEQ ID NO: 9659 


0.03 % 


1267.10434728 


2 


226 


SEQ ID NO: 9660 


0.00 % 


69.552 


3 


316 


SEQ ID NO: 9661 


0.00 % 


50.232 


4 


351 


SEQ ID NO: 9662 


0.00 % 


31.24872 


5 


159 


SEQ ID NO: 9663 


0.00 % 


13.6235739 


6 


406 


SEQ ID NO: 9664 


0.00 % 


11.4264 


7 


165 


SEQ ID NO: 9665 


0.00 % 


8.14407 


8 


238 


SEQ ID NO: 9666 


0.00 % 


7.0518 


9 


138 


SEQ ID NO: 9667 


0.00 % 


5.112072 


10 


130 


SEQ ID NO: 9668 


0.00 % 


3.00547233 


11 


303 


SEQ ID NO: 9669 


0.00 % 


2.59578 


12 


157 


SEQ ID NO: 9670 


0.00 % 


2.412585 


13 


219 


SEQ ID NO: 9671 


0.00 % 


2.103255861 


14 


305 


SEQ ID NO: 9672 


0.00 % 


1.86369 


15 


158 


SEQ ID NO: 9673 


0.00 % 


1.646892 


16 


331 


SEQ ID NO: 9674 


0.00 % 


1.614048 


17 


399 


SEQ ID NO: 9675 


0.00 % 


1.442246832 


18 


324 


SEQ ID NO: 9676 


0.00 % 


1.319625 


19 


312 


SEQ ID NO: 9677 


0.00 % 


1.233099 


20 


262 


SEQ ID NO: 9678 


0.00 % 


0.966 



HLA A 0201 - 10 mers 

Maximum possible score usi ng this molecule type H 3925227/1 

iRanklfstart position H Sequence || % of max. score II Score 
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1 


221 


SEQ ID NO: 9679 || 


0.00 % 


309.0498408 


2 


112 


SEQ ID NO: 9680 || 


0.00 % 


98.26704 


3 


330 


SEQ ID NO: 9681 || 


0.00 % 


98.26704 


4 


158 


SEQ ID NO: 9682 !| 


0.00 % 


36.31608 


5 


218 


SEQ ID NO: 9683 || 


0.00 % 


24.0754248 


6 


124 


SEQ ID NO: 9684 || 


0.00 % 


12.2199 


7 


55 


SEQ ID NO: 9685 II 


0.00 % 


10.467576 


8 


315 


SEQ ID NO: 9686 II 


0.00 % 


7.7274204 


9 


350 


SEQ ID NO: 9687 II 


0.00 % 


4.296699 


10 


405 


SEQ ID NO: 9688 H 


0.00 % 


4.286487 


11 


388 


SEQ ID NO: 9689 II 


0.00 % 


4.054785 


12 


322 


SEQ ID NO: 9690" || 


0.00 % 


3.883803 


13 


130 


SEQ ID NO: 9691 || 


0.00 % 


3.428691903 


14 


45 


SEQ ID NO: 9692 jj 


0.00 % 


3.411230625 


15 


132 


SEQ ID NO: 9693 || 


0.00 % 


2.99943 


16 


410 


SEQ ID NO: 9694 || 


0.00 % ' 


2.63718 


17 


- 316 


SEQ ID NO: 9695 H 


0.00 % 


2.48686074 


18 


104 


SEQ ID NO: 9696 II 


0.00 % 


2.477311485 


19 


164 


SEQ ID NO: 9697 || 


0.00 % 


2.2011 V 


20 


282 


SEQ ID NO: 9698 || 


0.00 % 


2.16591 



HLA A 1101 - 9 mers 


Maximu 


m possible score using thils molecule type . 


1 36 


Rank 


Start position 


Sequence 


9/6 of max. score 


1 Score 




361 


SEQ ID NO: 9699 


16.66 % 


1 6 


2 


53 


SEQ ID NO: 9700 


2.77 % 


i 1 


3 


240 


SEQ ID NO: 9701 


1.66 % 


1 0.6 


4 


241 


SEQ ID NO: 9702 


1.66 % 


1 0.6 







HLA A 1101 - 10 mers 




Maximu 


m possible score using this molecule type 


1 36 


Rank 


Start position 


Sequence 


% of max. score 


1 Score 


1 


361 


SEQ ID NO: 9703 


16.66 % 


1 6 


2 


93 


SEQ ID NO: 9704 


8.33 % 


[ 3 


3 


338 


SEQ ID NO: 9705 


3.33 % 


1.2 


4 


134 


SEQ ID NO: 9706 


2.77 % 


1 


5 


228 


SEQ ID NO: 9707 


2.5 % 


0.9 


6 


160 


SEQ ID NO: 9708 


2.22 % 


0.8 


7 


239 


SEQ ID NO: 9709 


1.66 % 


0.6 
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8 


240 


SEO ID NO: 9710 


1.66 % 


0.6 


9 


257 


SEO ID NO: 9711 


1.66 % 


0.6 


10 


379 


SEQ ID NO: 9712 


1.66 % 


0.6 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank I 


Start position 


Sequence 


9A> of max. score 


Score 


1 ! 


105 


SEQ ID NO: 9713 


14.81 % 


-800 


2 i 


66 


SEQ ID NO: 9714 


1.48 % 


80 


3 1 


93 


SEQ ID NO: 9715 


0.92 % 


50 




257 


SEQ ID NO: 9716 


0.55 % 


30 


5 1 


323 


SEQ ID NO: 9717 


0.37 % 


20 


6 ! 


211 


SEQ ID NO: 9718 


0.22 % 


12 


7 1 


219 


SEQ ID NO: 9719 


0.22 % . 


12 


8 1 


403 


SEQ ID NO: 9720 


0.18 % 


10 


9 ! 


343 


SEQ ID NO: 9721 


0.14 % 


8 


10 1 


12 


SEQ ID NO: 9722 


0.11 % 


6 


11 \ 


113 


SEQ ID NO: 9723 


0.11 % 


6 




48 


SEQ ID NO: 9724 


0.07 % 


4 


13 1 


56 


SEQ ID NO: 9725 


0.07 % 


4 


14 


150 


SEQ ID NO: 9726 


0.07 % 


4 




153 


SEQ ID NO: 9727 


0.07 % 


4 




159 


SEQ ID NO: 9728 


0.07 % 


4 


17 


213 


SEQ ID NO: 9729 


0.07% 


4 


18 


216 


SEQ ID NO: 9730 


0.07 % 


4 


19 


i 222 


SEQ ID NO: 9731 


0.07 % 


4 


20 


i 283 


SEQ ID NO: 9732 


0.07 % ; 


4 




HLA B7 - 10 mers 


Maximum possible score using this molecule type 


1 5400 




Start position 


Sequence 


°/o of max. score 


1 Score 


1 


36 


SEQ ID NO: 9733 


1.48 % 


1 80 


2 


150 


SEQ ID NO: 9734 


1.48 % 


1 80 


3 


343 


SEQ ID NO: 9735 


1.48 % 


1 80 


4 


12 


SEQ ID NO: 9736 


1.11 % 


1 60 


5 


308 


SEQ ID NO: 9737 


1.11 % 


1 60 


6 


130 


SEQ ID NO: 9738 


0.37 % 


1 20 


7 


55 


SEQ ID NO: 9739 


0.22 % 




8 


210 


SEQ ID NO: 9740 


0.22 % 
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9 


218 


SEQ ID NO: 9741 


0.22 % 


12 


10 


201 


SEQ ID NO: 9742 


0.18 % 


10 


11 


121 


SEQ ID NO: 9743 


0.14 % 


8 


12 


391 


SEQ ID NO: 9744 


0.13 % 


7.5 


13 


112 


SEQ ID NO: 9745 


0.11 % 


6 


14 


385 


SEQ ID NO: 9746 


0.11 % 


6 


15 


47 


SEQ ID NO: 9747 


0.07 % 


4 


16 


66 


SEQ ID NO: 9748 


0.07 % 


4 


17 


95 


SEQ ID NO: 9749 


0.07 % 


4 


18 


104 


SEQ ID NO: 9750 


0.07 % 


4 


19 


152 


SEQ ID NO: 9751 


0.07 % 


4 


20 


158 


SEQ ID NO: 9752 


0.07 % 


4 
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TABLE 26: Cloned sequences for E,coU expression 



ORF 


DNA length 


Cloning 




bp 


pET 


pGEX 


P28 


537 


Ndel/Xhol 




P65 


1917 


Nhel/ffindin 




NsplA 


2495 


Nhel/Xhol 




NsplB 


2153 


Ndel/Xhol 




NsplC 


2612 


Ndel/Xhol 




Nsp2A 


431 


Ndel/Xhol 


BamHI / Xhol 


Nsp2B 


426 


Ndel/Xhol 


Bamm / Xhol 


Nsp3 


870 


Ndel/Xhol 




Nsp4 


249 


Ndel/Xhol 


Bamm/Xhol 


Nsp5 


594 


Nhel/Xhol 




Nsp6 


339 


Ndel/Xhol 


Bamffl / Xhol 


Nsp7 


417 


Ndel/Xhol 


Bamm /Xhol 


Nsp9A 


1385 


Nhel/Xhol 




Nsp9B 


1409 


Ndel/Xhol 




NsplO 


1803 


Nhel/Xhol 




Nspll 


1581 


Ndel/Xhol 




Nspl2 


1038 


Ndel / Hindin 




Nspl3 


897 


Ndel/Xhol 




Spike (SI) 


1946 


Ndel/Xhol 




Spike (S2) 


1598 


Ndel/Xhol 




Spike (S1-S2) 


3545 


Ndel/Xhol 




HRl 


287 


Ndel/Xhol 


Bamm /Xhol 


HR2 


146 


Ndel/Xhol 


Bamm /Xhol 


ORF3A100 


525 


Ndel/Xhol 




ORF4 


465 


Ndel/Xhol 




Envelope (E) . 


231 


Ndel/Xhol 


Bamm /Xhol 


Matrix (M)A100 


366 


Ndel/Xhol 


Bamm / Xhol 


ORF7A18 


137 


Ndel/Xhol 


Bamm / Xhol 


ORF8 


369 


Ndel/Xhol 


Bamm /Xhol 


ORF9 


135 


Ndel/Xhol 


Bamm /Xhol 


ORFIO 


120 


Nhel/Xhol 


Bamffl / Xhol 


ORFll 


255 


Ndel/Xhol 


Bamm / Xhol 


Nucleocapsid (N) 


1269 


Ndel/EcoRI 




ORF12 


297 


Ndel/EcoM 


Bamffl /EcoRI 
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TABLE 27: Primers 
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ORF 


Forward primer 


Reverse primer 


P28 


9803 


—tI 


P65 


9804 


9819 


M^^IR 


9805 


9820 


^^^1^ 


9806 


9821 


O ^ 


9807 


9822 


Nsp2 + Nsp3 


9808 


9823 


Nsp4 to Nsp7 


9809 


9824 


Nsp9A 


9810 


9825 


Nsp9B 


9811 


9826 


NsplO 


9812 


9827 


Nspll 


9813 


9828 


Nspl2-Nspl3 


9814 


9829 


ORF3-ORF4 


9815 


9830 


Env-ORFIO 


9816 


9831 


ORF11-ORF12 


9817 


9832 



TABLE 28: Primers 



ORF 


Forward primer 


Reverse primer 


Nsp2A 


SEQIDNO: 9833 


SEQ ID NO: 9858 


Nsp2B 


SEQ ID NO: 9834 


SEQ ID NO: 9859 


Nsp3 


SEQ ID NO: 9835 


SEQ ID NO: 9860 


Nsp4 


SEQ ID NO: 9836 


SEQ ID NO: 9861 


Nsp5 


SEQ ID NO: 9837 


SEQ ID NO: 9862 


Nsp6 


SEQ ID NO: 9838 


SEQ ID NO: 9863 


Nsp7 


SEQ ID NO: 9839 


SEQ ED NO: 9864 


Nspl2 


SEQ ID NO: 9840 


SEQIDNO: 9865 


Nspl3 


SEQIDNO: 9841 


SEQIDNO: 9866 


Spike SI 


SEQ ID NO: 9842 


SEQIDNO: 9867 


Spike S2 


SEQIDNO: 9843 


SEQ ID NO: 9868 


Spike S1-S2 


SEQ ID NO: 9844 


SEQ ID NO: 9869 


HRl 


SEQ ID NO: 9845 


SEQIDNO: 9870 


HR2 


SEQ JD NO: 9846 


SEQ ID NO: 9871 


OrfSAlOO 


SEQ ID NO: 9847 


SEQ ID NO: 9872 


Orf4 


SEQ ID NO: 9848 


SEQ ID NO: 9873 


EnvE 


SEQ ID NO: 9849 


SEQ ID NO: 9874 


Matrix MA 100 


SEQ ID NO: 9850 


SEQ ID NO: 9875 


Orf7A18 


SEQIDNO: 9851 


SEQ ID NO: 9876 


OrfS 


SEQ ID NO: 9852 


SEQ ID NO: 9877 


Orf9 


SEQ ID NO: 9853 


SEQ ID NO: 9878 


OrflO 


SEQ ID NO: 9854 


SEQ ID NO: 9879 


Orfll 


SEQ ID NO: 9855 


SEQ ID NO: 9880 


Nucleocapsid N 


SEQ ID NO: 9856 


SEQIDNO: 9881 


Qrfl2 


SEQ ID NO: 9857 


SEQ ID NO: 9882 
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BNSDCKID: <WO__2004092360A2_I_> 



TABLE 29: Cloning, purification and expression in E.coli 



SARS CoV ORFs 


M.W 
Kd 


cloning 


Expr. 


purification as 


P28 


19.7 


+ 


+ 


his sol 


P65 


70,3 


+ 


+ 


his sol 


NsDlA(N-term) 


91,6 _ 


+ 


+ 


his ins 


Nsd1B fcore> 


80.8 


+ 


, - 




NsDlC(C-term) 


95.3 


+ 


- 




Nsp2AfN-term) 


15,8 


+ 


+ 


his ins 


Nsp2B (C-lerm) 


15,5 . 


+ 


+ 


his sol 


Nsp3 


31,9 


+ 


: 




Nsd4 


9,1 


+ 


+ 


; . 

his sol 


Nsp5 


21,8 






his sol 


NSPo 








ms SOI 


Nsp7 


15,3 


+ 


+ 


his ins 


Nsp9A(N-term) 


50,8 


+ 


_ 




Nsp9B (C-term) 


51.6 


+ 


+ 


his ins 


Nsp10 


66 








NSD11 


58 


- 






Nsp12 


38 


_ 






Nsp13 


32.7 




+ 


his ins 


Qnilro /Q1-hie\ 
opiKc ■"lilSf 


71 .3 




+._ 


bis jn^ _ 


Spike (S2-his) 


58,6 


+ 







Spike (S1S2-nis) 


130 


+ 


+ 


his ins 


HRl 


■4 4 
1 1 


+ 


+ 


nis ms 


HR2 


5.4 












+ 






vJnr'** 


'tis o 


+ 


+ 


nis ins ^rimcry 


Envelope CE) 


34,0 


+ 


+ 


net ine /m\ 

qsi ins UDL _ 


Matrix iiviiAioo 


13,3 


+ 


+ 


his ins 


ORF7A18^ 


31 


+ 


+ 


ast sol 


ORFS 


39 5 






gst ins (IB) 


ORF9 


30,8 


+ 


+ 


qst sol 


ORF10 


30,3 


+ 


+ 


gst ins (IB) 


ORF11 


35,2 


+ 


+ 


gst ins (IB) 


NucleocapsidtNl 


43.6 


+ 


±^ 


his ins 


ORF12 


36,7 


+ 


+ 


his ins 
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TABLE 30: E.coli expression, purity and yield 



Protein 


Tag 


Purity (%) 


Yield (mg/l) 


Nsp2A (N-tenn) 


His 


95 


1.7 


Nsp2B (C-terai) 


His 


95 


4.1 


Nsp4 


His 


95 


12.6 


Nsp5 


His 


95 


5.88 


Nsp6 


His 


95 


8.1 


P28 


His 


95 


1 


P65 


His 


80 


0.553 


HR2 


His 


95 


11.9 


HRl 


His 


80 


2.64 


NsplA 


His 


95 


0.267 


Spike S1-S2 


His 


80 . 


0.381 


Matrix M 


His 


85 


12.4 


ORF7 


GST 


85 


4.9 



TABLE 31: Primers 



SEQ ID NO: 


Rank 


Model 


Local 


OPosition) 


10235 


Fl 


1 


1 


(106) 


10236 


F2 


2 


1 


(728) 


10237 


F3 


3 




(112) 


10238 


F4 


5 




(1331) 


10239 


F5 


6 




(12) 


10240 


F6 


6 




(346) 


10241 


F7 


8 




(904) 


10242 


F8 


9 




(1016) 


10243 


F9 


9 




(1015) 


10244 


FIO 


9 




(719) 


10245 


Fll 


9 




(720) 


10246 


F12 


10 




(724) 


10247 


Rl 


2 




(1283) 


10248 


R2 


4 




(756) 


10249 


R3 


4 




(758) 


10250 


R4 


5 




(259) 


10251 


R5 


6 




(54) 


10252 


R6 


7 




(648) 


10253 


R7 


8 




(948) 


10254 


R8 


8 




(260) 


10255 


R9 


9 




(1282) 


10256 


RIO 


9 




(950) 


10257 


Rll 


9 




(756) ■ 


10258 


R12 


10 




(132) 



5 
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TABLE 32: Primers 



Primers 


List: 


[forward) 










Scores 








Hank 


Model 


Irfseal 


Sequence 


(Posltiion) 


PI 


7. 


1 


SEQ ID NO: 


10352 


(290) 


P2 


7 


1 


SEQ ID NO: 


10353 


(291) 


F3 


7 


1 


SEQ ID NO: 


10354 


(294) 


P4 


7 


1 


SEQ ID NO: 


10355 


(292) 


F5 


7 


1 


SEQ ID NO: 


10356 


(293) 


F6 


9 


1 


SEQ ID NO: 


10357 


(198) 


P7 


9 


1 


SEQ ID NO: 


10358 


(199) 


F8 


10 


1 


SEQ ID NO: 


10359 


(33) 


F9 


11 


1 


SEQ ID NO: 


10360 


(200) 


FIO 


11 


1 


SEQ ID NO: 


10361 


(299) 


Fll 


12 


1 


SEQ ID NO: 


10362 


(298) 


P12 


12 


1 


SEQ ID NO: 


10363 


(297) 






1 


SEQ ID NO: 


10364 


(35) 


F14 


14 


1 . 


SEQ ID NO : 


10365 


(34) 






1 


SEQ ID NO: 


10366 


(300) 


F16 


17 


1 


SEQ ID NO: 


10367 


(295) 


F17 


17 


1 


SEQ ID NO: 


10368 


(296) 


F18 




1 


SEQ ID NO: 


10369 


(175) 


F19 


17 


1 


SEQ ID NO: 


10370 


(36) 


F20 


20 


1 


SEQ ID NO: 


10371 


(202) 


P21 


20 


1 


SEQ ID NO: 


10372 


(201) 


P22 


28 


1 


SEQ ID NO: 


10373 


(204) 


P23 


28 


1 


SEQ ID NO: 


10374 


(203) 


F24 


29 


1 


SEQ ID NO: 


10375 


(269) 


F25 


29 


1 


SEQ ID NO: 


10376 


(268) 




Iiisi: ( 


reverse) 














Serene e 


(Position) 


Rl 


7 


1 


SEQ ID NO: 


10377 


(337) 


R2 


9 


1 


SEQ ID NO : 


10378 


(229) 


R3 






SEQ ID NO: 


10379 


(230) 


R4 


11 


1 


SEQ ID NO: 


10380 


(338) 




1? 


1 


SEQ ID NO : 


10381 


(207) 




12 




SEQ ID NO: 


10382 


(338) 




13 


^ 


SEQ ID NO: 


10383 


(231) 




14 


1 


SEQ ID 'NO: 


10384 


(80) 






1 


SEQ ID NO: 


10385 


(232) 


Rio 


15 




SEQ ID NO: 


10386 


(82) 


■Rl 1 


16 


1 


SEQ ID NO: 


10387 


(340) 


R12 




1 


SEQ ID NO: 


10388 


(83) 


R13 


17 


1 


SEQ ID NO: 


10389 


(206) 


R14 


17 


1 


SEQ ID NO: 


10390 


(82) 


R15 


17 


1 


SEQ ID NO: 


10391 


(337) 


R16 


18 


1 


SEQ ID NO: 


10392 


(341) 


R17 


20 


1 


SEQ ID NO: 


10393 


(340) 


R18 


20 


1 


SEQ ID NO: 


10394 


(233) 


R19 


21 


1 


SEQ ID NO: 


10395 


(79) 


R20 


22 


1 


SEQ ID NO: 


10396 


(213) 


R21 


28 


1 


SEQ ID NO: 


10397 


(236) 


R22 


29 


1 


SEQ ID NO: 


10398 


(317) 


R23 


32 


1 


SEQ ID NO: 


10399 


(391) 


R24 


35 


1 


SEQ ID NO: 


10400 


(57) 


R25 


36 


1 


SEQ ID NO: 


10401 


(237) 



Primers List (left part): SEQ ID NQ^: 10402-10433 



Primers list (right part): SEQ ID NO^: 10434-10464 



Primers List (forwaid): SEQ ID NO^: 10465-10484 | Primers List (reverse): SEQ ID NO^: 10485-10504 



BNSOCXSID: <WO__20CM092360A2J_> 
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TABLE 33: Primers 



PCT/US2004/011710 



Primers List (forward) 

Scores 



Rank 


Hodel 


Local 




Seguexioe 


(Posifciom) 


Fl 


1 


1 


SEQ 


ID 


NO: 


10580 


(637) 


F2 


2 


1 


SEQ 


ZD 


NO: 


10581 


(439) 


F3 


2 


1 


SEQ 


ID 


NO: 


10582 


(440) 


F4 


3 


1 


SEQ 


ID 


NO: 


10583 


(729) 


P5 


4 


1 


SEQ 


ID 


NO: 


10584 


(696) 


F6 


4 


1 


SEQ 


ID 


NO: 


10585 


(697) 


F7 


4 


1 


SEQ 


ID 


NO: 


10586 


(111) 


F8 


5 


1 


SEQ 


ID 


NO: 


10587 


(867) 


F9 


5 


1 


SEQ 


ID 


NO: 


10588 


(868) 


FIO 


5 


1 


SEQ 


ID 


NO: 


10589 


(869) 


Fll 


5 


1 


SEQ 


ID 


NO: 


10590 


(640) 


F12 


6 


1 


SEQ 


ID 


NO: 


10591 


(438) 


F13 


6 


1 


SEQ 


ID 


NO: 


10592 


(437) 


F14 


6 


1 


SEQ 


ID 


NO: 


10593 


(436) 


F15 


6 


1 


SEQ 


ID 


NO: 


10594 


(732) 


F16 


6 


1 


SEQ 


ID 


NO: 


10595 


(635) 


F17 


6 


1 


SEQ 


ID 


NO: 


10596 


(457) 


F18 


6 


1 


SEQ 


ID 


NO:' 


10597 


(458) 


F19 


6 


1 


SEQ 


ID 


NO:' 


10598 


(636) 


F20 


7 


1 


SEQ 


ID 


NO: 


10599 


(854) 


F21 


7 


1 


SEQ 


ID 


NO: 


10600 


(855) 


F22 


7 


1 


SEQ 


ID 


NO: 


10601 


(581) 


F23 


7 


1 


SEQ 


ID 


NO: 


10602 


(853) 


F24 


7 


1 


SEQ 


ID 


NO: 


10603 


(342) 


P25 


7 


1 


SEQ 


ID NO: 


10604 


(343) 


F26 


7 


1 


SEQ 


ID 


NO: 


10605 


(112) 


P27 


7 


1 


SEQ 


ID 


NO: 


10606 


(94) 


F28 


7 


1 


SEQ 


ID 


NO: 


10607 


(642) 


F29 


8 


1 


SEQ 


ID 


NO: 


10608 


(638) 


F30 


8 


1 


SEQ 


ID 


NO: 


10609 


(639) 


FBI 


8 


1 


SEQ 


ID 


NO: 


10610 


(730) 


F32 


8 


1 


SEQ 


ID 


NO- 


10611 


(641) 


F33 


8 


1 


SEQ 


ID 


NO: 


10612 


(731) 


F34 


8 


1 


SEQ 


ID 


NO: 


10613 


(326) 


F35 


8 


1 


SEQ 


ID 


NO: 


10614 


(325) 


F36 


9 


1 


SEQ 


ID 


NO: 


10615 


(517) 


F37 


9 


1 


SEQ 


ID 


NO: 


10616 


(701) 


F38 


9 


1 


SEQ 


ID 


NO: 


10617 


(208) 


F39 


9 


1 


SEQ 


ID 


NO: 


10618 


(209) 


F40 








ID 


NO: 


10619 


(702) 


F41 


9 


1 


SEQ 


ID NO: 


10620 


(210) 


F42 


10 


1 


SEQ 


ID 


NO: 


10621 


(634) 


F43 


10 


1 


SEQ 


ID 


NO: 


10622 


(694) 


F44 


10 


1 


SEQ 


ID 


NO: 


10623 


(693) 


F45 


10 


1 


SEQ 


ID 


NO: 


10624 


(728) 


F46 


10 


1 


SEQ 


ID 


NO: 


10625 


(695) 


F47 


10 


1 


SEQ 


ID 


NO: 


10626 


(95) 


F48 


11 


1 


SEQ 


ID 


NO: 


10627 


(455) 


F49 


11 


1 


SEQ 


ID 


NO: 


10628 


(456) 


F50 


11 


1 


SEQ 


ID NO: 


10629 


(454) 



-481- 



BNSDOCID: <WO__a0040923eOAa_l_> 



wo 2004/092360 



PCT/US2004/011710 



Primers List (reverse) 



Scores 



Rank 


Model 


Irf>cal 






(Posi'bxon) 


Rl 


1 


1 


SEQ 


ID NO: 


10630 


(367) 


R2 


1 


1 


SEQ 


ID NO; 


10631 


(666) 


R3 


2 


1 


SEQ 


ID NO: 


10632 


(464) 


R4 


3 


1 


SEQ 


ID NO: 


10633 


(669) 


R5 


3 


1 


SEQ 


ID NO: 


10634 


(750) 


R6 


4 


1 


SEQ 


ID NO: 


10635 


(720) 


R7 


4 


1 


SEQ 


ID NO: 


10636 


(465) 


R8 


4 


1 


SEQ 


XD NO: 


10637 


(370) 


R9 


4 


1 


SEQ 


ID NO: 


10638 


(668) 


RIO 


4 


1 


SEQ 


ID NO: 


10639 


(135) 


Rll 


5 


1 


SEQ 


ID NO: 


10640 


(901) 


R12 


5 


1 


SEQ 


ID NO: 


10641 


(667) 


R13 


6 


1 


SEQ 


ID NO: 


10642 


(609) 


R14 


6 


1 


SEQ 


ID NO: 


10643 


(464) 


R15 


6 


1 


SEQ 


ID NO: 


10644 


(665) 


R16 


6 


1 


SEQ 


ID NO: 


10645 


(486) 


R17 


6 


1 


SEQ 


ID NO: 


10646 


(356) 


R18 


6 


1 


SEQ 


XD NO: 


10647 


(758) 


R19 


7 


1 


SEQ 


ID NO: 


10648 


(366) 


R20 


7 


1 


SEQ 


XD NO: 


10649 


(368) 


R21 


7 


1 


SEQ 


ID NO: 


10650 


'(136) 


R22 


7 


1 


SEQ 


ID NO: 


10651 


(675) 


R23 


7 


1 


SEQ 


ID NO: 


10652 


(366) 


R24 


7 


1 


SEQ 


ID NO: 


10653 


(608) 


R25 . 


7 


1 


SEQ 


ID NO : 


10654 


(884) 


R26 


7 


1 


SEQ 


ID NO: 


10655 


(120) 


R27 


8 


1 


SEQ 


ID NO: 


10656 


(355) 


R28 


8 


1 


SEQ 


ID NO: 


10657 


(671) 


R29 


8 


1 


SEQ 


ID NO: 


10658 


(756) 


R30 


8 


1 


SEQ 


ID NO: 


10659 


(751) 


R31 


8 


1 


SEQ 


ID NO: 


10660 


(666) 


R32 


9 


1 


SEQ 


ID NO: 


10661 


(242) 


R33 


.9 


1 


SEQ 


ID NO: 


10662 


(543) 


R34 . 


9 


1 


SEQ 


ID NO: 


10663 


(724) 


R35 


9 


1 


SEQ 


ID NO': 


10664 


(482) 


R36 


10 


1 


SEQ 


ID NO : 


10665 


(121) 


R37 


10 


1 


SEQ 


ID NO': 


10666 


(662) 


R38 


10 


1 


SEQ 


ID NO: 


10667 


(750) 


R39 


10 


1 


SEQ 


ID NO: 


10668 


(719) 


R40 


10 


1 


SEQ 


ID NO: 


10669 


(242) 


R41 


11 


1 


SEQ 


ID NO: 


10670 


(484) 


R42 


11 


1 


SEQ 


ID NO: 


10671 


(375) 


R43 


11 


1 


SEQ 


ID NO: 


10672 


(728) 


R44 


11 


1 


SEQ 


ID NO: 


10673 


(373) 


R45 


11 


1 


SEQ 


ID NO: 


10674 


(998) 


R46 


11 


1 


SEQ 


ID NO: 


10675 


(486) 


R47 


12 


1 


SEQ 


ID NO: 


10676 


(881) 


R48 


12 


1 


SEQ 


ID NO: 


10677 


(882) 


R49 


12 


1 


SEQ 


ID NO: 


10678 


(244) 


RSO 


12 


1 


SEQ 


ID NO: 


10679 


(1003) 



Primers List (left part): SEQ ID NO^: 10680-10974 


Primers List (right part): SEQ ID NO^: 10975-11282 


Primers List (forward): SEQ ID NO^: 11283-11302 


Primers List (reverse): SEQ ID NO^: 11303-11322 
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TABLE 34 



Compounc 

# 


1 

StructurG 


Name 


MH+ 


1 




N-nriethyl-4-[(2-{[2-(1 - 
Tiethylethyl)phenyl]amino}-1 H- 
benzimjclazol-5-yl)oxy]pyricline- 
2-carboxamicle 


402.5 


2 


H.C-S|-CH, 


N-methyl-4-{[1 -methyl-2-({3- 
[(trimethylsilyl)ethynyl]phenyl}am 
ino)-1 H-benzimidazol-S- 
ylloxv)pvricline-2-carboxam ide 


470.6 


3 






N-methyl-4-[(1 -methyl-2-{[2- 
(phenylcarbonyl)phenyl]am ino}- 
1H-benzimidazo!-5- 
yl)oxvlpvridine-2-carboxamide 


478.5 


4 


H3C 


0" 


4-(methyloxy)-N-[6-(methyloxy)- 
1 ,3-benzothiazol-2-yl]-3- 
litrobenzamrde 


360.4 


5 




4-({2-[(4-butylphenyl)amino]-1,3- 

benzothiazol-5-yl}oxy)-N- 

methylpvridine-2M;arboxamide 


433.5 




0 




M-methyl-4-({1 -methyl-2-I(6- 
pyrrolldln-1-yIpyridin-3-yl)amino]- 
1 H-benzimidazol-5- 
ld}rocy)pyridine-2-carboxamide 


444.5 


7 


d 






l-({2-[1 ,1 ■-bi(cyclohexyl)-2- 
irlamino]-1-methyl-1H- 
3enzimidazol-5-yl}oxy)-N- 
Tiethylpyridine-2-carboxam ide 


462.6 


8 


H.C ^ 

_i 


^-({2-[(4-chlorophenyl)amino]-1- 
nethyl-1 H-benzimidazoI-5- 
rl}oxy)-N-1 ,3-thiazoI-2- 
?!2\?;|d!ne-2-carboxamide 


477.9 
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9 




4-[(1-m©thyl-2-{[2- 
(methyloxy)phenyqamino}->1 H- 
benzimidazol-5-yi)oxy]-N-[3- 
(methyioxy)propyr|pyrid[ne-2> 
carboxamide 


462.5 


10 




4-({2-[(4-ethyIphenyl)amino]-1,3- 

benzoxazol-5-y}}oxy)-N- 

methylpyridlne-2-carboxamlde 


389.4 


11 




1 -[(3-tluorophGnyl)carbonyl]-4- 
([4- 

(trlfluoromethyl)phenyl]methyl}pi 
perazine 


367.4 


12 


^3 9 9H3 


1 -[2-(ethyloxy)phenylH-{[3.4.5- 
tris(methyloxy)phenyi]carbonyl}p 
iperazine 


401.5 


13 


CI 


1 -(3-chlorophenyl)-4-{[2- 

(ethyloxy)phenyl]carbonyl}pipera 

zine 


345.8 


14 




3-({4-[(2E)-3-phenylprop-2- 
enyl]piperazin-1 -yl}carbonyl)-7- 
oxabicyclo[2.2.1 ]heptane-2- 
carboxylic acid 


371.4 


15 




1 -[2-(nTethyloxy)phenyl]-4- 
{[3.4,5- 

[ris(methyloxy)phenyl]carbonyl}p 
iperazine 


387.4 


16 




3-[(4-pyridin-2-ylplperazin-1 - 
yl)carbonyl]-7- 
oxabicyclo[2.2.1]heptane-2- 
carboxylic acid 


332.4 
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17 




3-pentyI-7-[(4-phenylpiperazrn-1 
yl)carbonyl]-2-thloxo-2,3- 
dlhydroquinazolin-4n H)-one 


437.6 


18 




1-[(E)-({4-[(2.4- 

dimethylphenyl)methyl]piperazin 
-1 -yl}imino)methyl]naphthalen-2- 
oi 


374.5 


19 


o 


5-chloro-1-{[3- 

(trifluoromethyl)phenyl]methyl}- 
1H-indoIe-2,3-dione 


340.7 


20 


6 o 


1 -[(4-methylphenyl)methyl]-5- 
iitro-1 H-indole-2.3-dione 


297.3 


21 


CH3 CH3 0 


1 -methyl-6,7-bIs(methyloxy)-2- 
[[3-(methyloxy)phenyl]carboni^}- 
1 .2.3.4-tetrahvdroisoauinoline 


342.4 


22 


^^^^^^^^ 


1 -methyl-6,7-bis(methyloxy)-2- 
naphthalen-2-yJcarbonyl)- 
1 .2.3.4-tetrahvdroisoauinoljne 


362.4 


23 




2- (trlfluoromethyl)phenyl]methyl 

3- I4-(aminocarbonyl)phenyl]-2- 
3ycIoheptyI-1 -0x0-1 ,2,3,4- 
etrahydroisoquinoljne-4- 
:arboxvlate 


565.6 


24 




inthra[1 ,2-c][1 ,2,5]thiadiazole- 
3,11-dione 


267.3 



-485- 



wo 2004/092360 



PCT/US2004/011710 



25 




benzorb1oxanthrene-6,1 1-dione 


265.2 


26 


o 

o o 


ethyl 6,11 -dioxo-6,11 - 
clihydrobenzo[b]phena2ine-2- 
carboxytate 


333.3 


27 


K,C^ CH 

o=s=o O 

uoo 


N,N-dimethyl-9,1 0-dioxo-9,1 0- 
dihydroanthracene-1 - 
sulfonamide 


316.3 


28 




2-(trifluoromethyl)-3-{[3,4,5- 

tris(methyIoxy)phenyl]carbonyl}n 

aphtho[2,3-b]furan-4,9-dion© 


461 .4 


29 


o 


2-(2-oxopropyl)-2-phenyl-1 H- 
indene-1 ,3(2H)-dlone 


279.3 


30 




ethyl 4-{5-[(3- 

nitrophenyl)carbonyl]-1,3-dioxo- 
1 ,3-dihydro-2H-isoindol-2- 
yl}benzoate 


445.4 


31 


O CI 


5,6-dichloro-2-{2-chloro-5- 
(trifluoromethyl)phenyl]-1 H- 
lsolndole-1 .3(2H)-dlone 


395.6 


32 


H3C. ^^SV. 


3-bromo-4-{[(2- 

fIuorophenyl)methyl]oxy}-5- 

(methyloxy)benzaldehyde 

thiosemicarbazone 


413.3 
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33 




2-[4-(3-chlorophenyl)piperazin-1 

yl]-5-nitrobenzalclehyde 

thiosemicarbazone 


419.9 


34 




4.{[2-(3- 

chlorophenyl)ethyllamlno)-3- 

nitrobenzaldehyde 

thiosemlcarbazone 


378.9 


35 


H3C 

CH3 N.^|^.--|| ^NHg 

^ S 


(1 E)-6,9-dimethyl-2,3.4,9- 
tetrahydro-1 H-carbazol-1 -one 
thiosemicarbazone 


287.4 


36 


H_N 


(2E)-1 ,1 '-bi(cyclohexan)-1 -en-2- 
one thiosemicarbazone 


252.4 


37 




4-{[2-(4- 

chloroplienyl)ethyl]amlno}-3- 

nltrobenzaldehyde 

thiosemicarbazone 


378.9 


38 




4-(diethylamino)-2-{[(4- 
fluorophenyl)methyl]oxy}benzatd 
ehyde N-(2-piperidin-1 - 
yiethyDthiosemicarbazone 


486.7 


39 




3,4-bis(methyioxy)benzaldehyde 
1 ,1 -dioxido-1 ,2-benzlsothiazol- 
3-yl)(methvl)hvdrazone 


360.4 


40 




(2E)-2-[(4-chiorophenyl)(5- 
3hlorothien-2- 

/l)methylidene]hydrazInecarboxi 
midamide 


314.2 



-487- 



wo 2004/092360 



PCTAJS2004/011710 



41 


/ 


2-(4-amino-2-oxo-1 -propyl-1 ,2- 
dihydroquinolin-3-yl)-l H- 
benzimldazole-6-carbonitrtle 


344.4 


42 




4-amino-6-fluoro-7-{{[4- 
(methyloxy)phenyl]methyl}amino 
)-3-[5-(4-methylpiperazin-1 -yl)- 
1 H-benzimidazol-2-yl]quinolin- 
2(1H)-one 


528.6 


43 




B-chloro-3-(5-chloro-1 H- 
benzimidazol-2-yl)-4-{[2- 
(dimethylamino)ethyl]amino}quin 
olin-2(1H)-onG 


417.3 


44 


H3C N--^^ 


4-amino-5-(1 H-benzimidazol-2- 
yl)-1-methyl-1 ,7-dihydro-6H- 
t)yrazolor3.4-blpyridin-6-one 


281.3 


45 


6 0 


5,5-dimethyl-4-methylidene-3- 
(2,4,6-trinitrophenyl)-1 ,3- 
oxazolidin-2-one 


339.2 


46 


0 


5-methyl-2-[4- 

(methyloxy)phenyl]hexahydro- 
1 H-isoindole-1 .3(2H)-dione 


274.3 


47 


s 


5-methyl-2-(4- 

Tiethylphenyl)hexahydro-1 H- 
isoindoIe-1 ,3(2H)-dione 


258.3 


48 


N^NH 


N~2~-(4-chlorophenyl)-6,6- 
dimethyl-l ,6-dlhydro-1 ,3,5- 
triazine-2,4-diamine 


252.7 
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49 




(72)-7-(furan-2-ylmethyliclene)-3 

phenyl-3,4-clihydro-2H- 

[1 .3]thiazoloI3,2-a][1 .3,5]triazin- 

6(7H)-one 


312.4 


50 




(3aR,9R,9aR)-6,7-dihydroxy-9- 

[3,4,5-tris(methyloxy)phenyl]- 

3a,4,9,9a- 

tetrahydronaphtho[2,3-c]furan- 
1(3H)-one 


387.4 


51 


CH, >^ 

,N-0" 

o' 


6-chloro-2-(ethyloxy)-4-methyl-3- 
(4-nitrophenyl)-3a,4,9,9a- 
tetrahydro-3H-pyiTolo[2,3- 
b]quinoxaiine 


387.8 


52 


H 


ethyl 2-{ethyloxy)-4-methyl- 
3a,4,9,9a-tetrahydro-3H- 
Dyrrolo[2,3-b]quinoxallne-3- 
carboxylate 


304.4 


53 


H C 


ethyl 4-({[2,5- 

3is(methyloxy)phenyl]amino}met 
hyl)-3,5-dimethyl-1 H-pyrrole-2- 
carboxylate 


333.4 


54 




1-{3-[(6-amino-5-nitropyridin-2- 
yl)amino]propyl}-4-(2- 
chlorophenyl)-N-[(2S)-2- 
hydroxypropyl]-1 H-pyrrole-3- 
carboxamide 


473.9 


55 




(4-methylphenyl)(5-nitro-2- 
plperidin-1 -vlphenvDmethanone 


325.4 


56 


Ob 


2S,5R)-N~1 ~-(4-methylphenyl)- 
5-phenyl-N~2~-(2-pyridin-2- 
/lethyl)pyrrolid(ne-1 ,2- 
dicarboxamide 


429.5 
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57 




2-[(3S)-3-(acetylamfno)-2- 
oxopyrroHdin-l -yl]-N-[2-(4- 
fluorophenyOethyllacetamlde 


322.4 


58 




N-[2-(2,4-clichlorophenyl)ethyl]- 
4-({(Z)-[(4.4- 

difluorocyclohexyl)imino]I(3S)-3- 
methylpiperazin-1 - 
yl]methyl}amlno)benzamide 


553.5 


59 




4-[4-(methyloxy)phenyl]-5- 
phenylisoxazole 


252.3 


60 


H.C'^CK, 


methyl 4-{[4-(1-methylethyl)-2.3- 
dioxo-7-(trifluoromethyl)-3,4- 
dihydroquinoxalin-1 (2H)- 
yi]methyl}benzoate 


421.4 


61 


o 


(3beta,1 6beta)-3, 1 4,1 6- 
trihydroxvbufa-20.22-dienolide 


403.5 


62 


Q 

o 


2-(aminomethyl)-1 -(2-pyridin-2- 
\/lethvl)aulnazolin-4(l H)-one 
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BRIEF DESCRIPTION OF SEQUENCE LISTING 



SEQ ID NO: 


Description 




Draft genome assembly from The Genome Science Center in British Colombia, 
Canada of sequence from TOR2 isolate. TOR2_draftjenomejissmbh 120403 Release 1 


2 


CDC SARS-CoV strain sequence. Entire nucleotide sequence (Urbani strain) 


3-20 


Group-specific coronavirus gene products 

> Feline infectious peritonitis virus (FIPV) 

3/4 = ORF 3b; 5/6 = ORF 3X; 7/8 = ORF 3A 

> Canine coronavirus 

9/10 = ORF 7b; 1 1/12 = ORF 7a 

> Avian infectious bronchitis virus 

13/14 = ORF 5b; 15/16 = ORF 5a; 17/18 = ORF 3a: 19/20 = ORF 3b 


21-520 


500 primers for left part 


521-1020 


500 primers for right part 


1021-3520 


Forward primers from Table 4 


3521-6020 


Reverse primers from Table 4 


6021-6026 


Figure 9 primers 


6027-6033 


Figure 1 1 primers 


6034-6038 


Five primers from http://contem.neJin.or8/cgi/reprint/NEJMod030781v2.pdf 


6039-6051 


PEPltoPEP13 


6052 


Extended PEP13 


6053-6056 


229E hiiman coronavirus sequences 


6057-6060 


TGV sequences 


6061-6064 


PEDV sequences 


6065-6068 


Bovine coronavirus sequences 


6069-6071 


Murine hepatitis virus sequences 


6072-6075 


AIBV sequences 


6076-6170 


Primer sequences (forward) 


6171-6265 


Primer sequences (reverse) 


6266-6304 


Primer sequences (forward) 


6305-6343 


Primer sequences (reverse) 


6344-6366 


Primer sequences (forward) 


6367-6392 


Primer sequences (reverse) 


6393-6440 


Primer sequences (forward) F1-F48 


6441-6487 


Primer sequences (reverse) R 1 -R47 


6488-6559 


Primer sequences 


6560-6568 


Primer sequences 


6569 


The nsp2 proteinase (3CL-PRO) sequence in SARS coronavirus 


6570-72 


The nsp2 proteinases (3CLp) of avian IB V, MHV, and BCoV 


6573 


Consensus nsp2 proteinases sequence 


6574-6577 


IG sequences from Figure 18 


6578 


Expression construct of nSh in pCMVIII 


6579 


Expression construct of nS in pCMVm 


6580 


Expression construct of nSh ATC in pCMVIII 


6581 


Expression construct of nS ATC in pCMVIII 


6582 


Expression construct of nSlh in pCMVlll 


6583 


Expression construct of nSl in pCMVm 


6584-6585 


Primers for cDNA amplification 


6585-6587 


Primers for RT-PCR 


6588-6809 


Component sequences of Figure 23 (>4 amino acids) 


6810-7179 


Component sequences of Figure 24 (>4 amino acids) 


7180-7187 


N-glycosylation sites within SEQ ID NO: 6039 


7188-7189 


Component sequences of Figure 25 


7190 


Fragment of SEQ ID NO: 7 1 88 


7191 


Polynucleotide encoding SEQ ID NO: 7190 


7192 


Amino acids 879-1005 of SEQ ID NO: 6042 


7193 


Amino acids 879-980 of SEQ ID NO: 6042 
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Amino acids 901-1005 of SEQ ID NO: 6042 
Amino acids 1144-1201 of SEQ ID NO: 6042 



Amino acids 1144-1196 of SEQ ID NO: 6042 



Membrane fusion peptide regions 



NadA-based polypeptides 



N-glycosvIation sites within SEQ ID NO: 6042 



Slippage region 



Orflab polyprotein 



Orflab polyproteins 

Xz sequences for SEQ ID NOS 7233-7244 



Orflab polyproteins 



Zinc binding region 2 site 



r-glycosylation sites in SEP ID NOS: 6040-41.6043.6045-46.6050-51 



Polypeptides and polynucleotides 



Intergenic sequences 



Nucleotides from 5" end of SARSV genome followed hy intergenic seoue^^ 

rsJarlA fo-nctnir-tc ^ ^ — ■ 



NadA constructs 

Fragments of SEQ ID NO: 6042 



v sequence 



NadA leader sequences 



Amino acid sequencess from NadA 



PGR primers 



Primers 



CCACC sequence 



3' UTR forward 



3' UTR reverse 



3' UTR probes 



primers 



5' UTR forward primers 



5' UTR reverse primers 



5' UTR probes 



Conserved octanucleotide 



8041-8280 



8281-8486 



8487-8665 



8666-8820 



8821-9018 



9019-9131 



9132-9308 



9309-9437 



9438-9538 



9539-9752 



9753-9763 



Reverse complement of SEQ ID NO: 7293 



Intergenic sequence 



PolyT 



Stem-loop sequence 



Poly-glycine linkers 



Poly-histidine tag 



Nucleocapsid epitope s 



Antisense primer 



Probes 



Antigenic fragments of SEQ ID NO: 6042 



T-epitope analysis of SEQ ID NO: 6039 



T-epitope analysis of SEQ ID NO: 6040 



T-epitope analysis of SEQ ID NO: 6041 



T-epitope analysis of SEQ ID NO: 6042 



T-epitope analysis of SEQ ID NO: 6043 



T-epitope analysis of SEQ ID NO: 6044 



T-epitope analysis of SEQ ID NO: 6045 



T-epitope analysis of SEQ ID NO: 6046 



T-epitope analysis of SEQ ED NO: 6047 



T-epitope analysis of SEQ ID NO: 6048 



T-epitope analysis of SEQ ID NO: 6049 



T-epitope analysis of SEQ ID NO: 6050 



T-epitope analysis of SEQ ID NO: 6052 



9764-9765 



9766-9779 



Primers for spike protein ampli fication, particularly fragments o f^^ikT' 
N-glycosvlation sites within SEQ ID NO: 6039 



Cleavage pr oducts for ORFlab (Table 10) 
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9780-9782 


Forward primer, reverse primer, probe 


9783-9784 


Lysine-rich region 


9785-9798 


Oligonucleotides \ised for S cerevisiae expression 


9799-9802 


Sequences from Figures 65 «fe 66 


9803-9882 


Primers for E.coli cloning 


9883-9885 


BCV nucleotide sequences for Figures 3A, 3B 3C 


9886-9891 


BCV amino acid sequences for Figures 4A, 4B 4C 4D 4E 4F 


9892 


BCVS'UTR ■ 


9893 


BCV 3' UTR " — 


9894-9896 


MHV nucleotide sequences for Figures 3 A, 3B 3C 


9897-9902 


MHV amino acid sequences for Figures 4A, 4B 4C 4D 4E,4F 


9903-9904 


ALB V nucleotide sequences for Figures 3 A 3B 


9905-9909 


AlB V amino acid sequences for Figures 4A, 4B 4D 4E 4F 


9910 


AIBVS'UTO ' ' ' ■ 


9911 


AIBV 3' UTR 


9912-9913 


HOBMPRO. HOBtlGtiA niirlftntiH** c<>niif>nr>sc -fi^i- 'c:niir<><. in ir* 


9914-9918 


Human C!oV aminn Af^d <ftnii^-n/*^c ^nr T7imi*-«>c ^ A ATi. Af^ A'C ATS 


9919 


HCoV-OC43 5' UTR '■ " 


9920 


HCoV-OC43 3' UTR ~~ ~ 


9921-9923 


pCMVKm2 vectors ~~ ~~~ — ' 


9924-9926 


Codon-optimised r^, M and E sequences 


9927 


BNI-1 r 


9928-9959 


Oonstituent amino acid sequences ^4aa inferred from SEQ ID NO* 9927 


9960 


ORFlab variant ~ 


9961 


ORFla variant 


9962 


Spike variant 


9963 


Membrane variant 


9964 


Nucleocapsid variant 


9965-9966 


Short ORFs " — ~ ' 


9967 


ERA complete genome 
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CLAIMS 

1. An isolated polypeptide of the SARS virus. 

2. The polypeptide of claim 1, wherein the polypeptide is a Spike (S) polypeptide, an Env 
(E) polypeptide, a Membrane (M) polypeptide, a hemagglutinin-esterase polypeptide (HE), a 

5 nucleocapsid (N) polypeptide, a ORFla polypeptide, a ORFlab polypeptide, a proteolytic 
fragment of a ORFla polypeptide, or a proteolytic fragment of a ORFlab polypeptide. 

3. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO'': 6039, 7232, 9766, 9767, 9768, 9769, 9770, 
9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 6042, 6043, 6044, 6045, 6046, 6047, 

10 6048, 6049, 6050 or 6052. 

4. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
having >75% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO^: 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 

9769, 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 
15 10338, 10339, 10340, 10341, 10342, 10532, 10533, 10571. 10572, 10573, 10574, 10575, 10576, 

10577, 10578, 10579, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

5. The polypeptide of claim 1, wherein the polypeptide comprises a fragment of at least 10 
consecutive amino acids of an amino acid sequence selected from the group consisting of SEQ 
ID NO^: 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 

20 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579. 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

6. A polypeptide comprising an amino acid sequence having >80% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO^: 6042, 6043, 6044, 

25 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 9770, 9771, 9772, 9773, 
9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 10339, 10340, 10341, 
10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 10578, 10579, 11552, 
11561, 11562, 11618, 11619, 11620. 11627, 11630, 11633 & 11636. 

7. A polypeptide comprising an amino acid sequence that comprises a fragment of at least 
30 10 consecutive amino acids of an amino acid sequence selected from the group consisting SEQ 

ID NO^: 6042, 6043, 6044, 6045, 6046. 6047, 6048, 6049, 6050. 6052, 9766, 9767, 9768, 9769, 

9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579, 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 
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8. A polypeptide comprising an amino acid sequence having >80% sequence identity to 
SEQ ID NO: 6042, and/or comprising an amino acid sequence that comprises a fragment of at 
least 10 consecutive amino acids of SEQ ID NO: 6042, wherein the polypeptide is in the form of 
a trimer. 

5 9. Nucleic acid encoding the polypeptide of any one of claims 1 to 8. 

10. Nucleic acid according to claim 9, comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO^: 7191, 7273, 7275, 7277, 7279, 7281, 7283, 7285, 7287, 7289. 
7291, 7292, 7293, 9968, 10066, 10084, 10299, 10505, 11323, 11563, 11639 & 11640. 

11. A polynucleotide comprising a nucleotide sequence having >80% sequence identity to 
10 the nucleic acid of claim 9 or claim 10. 

12. A polynucleotide comprising a fragment of at least 10 consecutive nucleotides of the 
nucleic acid of claim 9 or claim 10. 

13. Antibody that recognizes the polypeptide of any one of claim 1 to 8. 

14. The antibody of claim 13, wherein said antibody recognizes the polypeptide comprising 
15 the amino acid sequence of SEQ ID NO: 6042 or a fragment thereof. 

15. The antibody of claim 14, wherein said antibody recognizes the polypeptide comprising 
the amino acid sequence of SEQ ID NO: 6042 or a fragment thereof in trimeric form. 

16. The antibody of claim 13, wherein the antibody is a monoclonal antibody, 

17. The antibody of claim 13, wherein the. antibody is a human antibody, 

20 18. An immunoassay for detecting a SARS virus antigen in a sample, comprising the step of 
contacting the sample with the antibody of any one of claims 13 to 17. 

19. An immunoassay for detecting an antibody against a SARS virus antigen in a sample, 
comprising the step of contacting the sample with the polypeptide of any one of claims 1 to 8. 

20. A method of detecting an antibody against a SARS virus antigen in a sample comprising 
25 contacting said sample with the polypeptide of any one of claims 1 to 8, under conditions 

suitable for binding said polypeptide to said antibody, if present, and detecting the binding of 
said polypeptide to said antibody. 

21 . A method for detecting a SARS virus antigen in a sample comprising contacting said 
sample with the antibody of any one of claims 13 to 17, under conditions suitable for binding 

30 said antibody to said antigen, if present, and detecting the binding of said antibody to said 
antigen. 
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22. A vaccine for the treatment or prevention of severe acute respiratory syndrome (SARS), 
comprising an inactivated SARS virus, a kiUed SARS virus, an attenuated SARS virus, a split 
SARS vims preparation, or at least one purified SARS virus antigens. 

23. The vaccine of claim 22, comprising a purified polypeptide according to any one of 
claims 1 to 8. 

24. The vaccine of claim 22 or claim 23. wherein the antigen is a purified SARS virus 
antigen in the f omi of a VLP. 

25. The vaccine of any one of claims 22 to 24, fijrther comprising an adjuvant. 

26. The vaccine of claim 25, wherein the adjuvant is an aluminium salt or is MF59. 

27. The vaccine of any one of claims 22 to 26, comprising more than one SARS virus 
antigen. 

28. The vaccine of claim 27, wherein the antigens are selected from S, E, N and M. 

29. The vaccine of claim 22, comprising an inactivated SARS virus. 

30. The vaccine of claim 29, wherein said virus is inactivated by chemical or physical means. 
The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 

an effective amount of one or more of the foUowing agents selected from the group consisting of 
detergents, formaldehyde, formalin, P-propriolactone, and UV light. 

32. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 
methylene blue, psoralen and carboxyfullerene (C60). 

33. The vaccine of claim 30. wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the foUowing agents selected from the group consisting of 
binary ethylamine, acetyl ethyleneimine and gamma irradiation. 

34. The vaccine of claim 31, wherein said inactivation comprises treatment with P- 
propriolactone. 

35. The vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 0 01 
to 0.5%. 

36. The vaccine of claim 34. wherem said P-propriolactone is used at a concentration of 0 5 
to 0.2%. 

37. The vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 
0.025 to 0.1%. 
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38. A method of inactivating SARS virus comprising exposing the viras to an inactivation 
agent for 12 to 24 hours at refrigeration temperatures followed hydrolysis of any residual 
inactivating agent by elevating the temperature for three hours. 

39. The method of claim 38, wherein the inactivation agent is p-prbpriolactone, 

40. The method of claim 38, wherein the refrigeration temperature is between 0°C and 8"C. 

41 . The method of claim 38, wherein the elevated temperature is between SyC and 41*'C. 

42. A method for making an inactivated SARS vaccine comprising: 

a. innoculating a mammalian cell culture with SARS virus; 

b. cultivating the infected cells; 

c. harvesting SARS virus containing supernatant; 

d. inactivating the SARS virus; and 

a. purifying the inactivated SARS virus. 

43. The method of claim 42, wherein said mammalian cell culture is derived from one or 
more of the cell types selected from the group consisting of fibroblast cells, endotheHal cells, 
hepatocytes, keratinocytes, immune cells, mammary cells, smooth muscle cells, melanocyte 
cells, neural cells, prostate cells, renal cells, skeletal cells, liver cells, retinoblast cells and 
stromal cells. 

44. The method of claim 42, wherein said manunalian ceU culture is derived from a cell 
culture selected from the group consisting of human cells, non-human primate cells, HeLa cells, 
human diploid cells, fetal rhesus lung cells, human embryonic kidney cells, VERO cells, hoise 
cells, cow cells, sheep cells, dog cells, cat cells or rodent cells. 

45. The method of claim 42, wherein said mammalian cell culture is derived from VERO 
cells or fetal rhesus kidney cells. 

46. The method of claim 42, wherein said manmialian cells are cultured in serum free media. 

47. The method of claim 42, wherein said mammalian cells are cultured in protein free 
media. 

48. The method of claim 42, wherein said inoculating step comprising absorbing the SARS 
virus onto the cell culture for 60 to 300 minutes. 

49. The method of claim 42, wherein said inoculating step is conducted at 25®C to 40'*C. 

50. The method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of gradient centrifugation, ultracentrifugation, 
continuous-flow ultracentrifugation, chromatography, polyethylene glycol precipitation, and 
ammonium sulfate precipitation. 
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51. The method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of ultrafiltration and dialfiltration. 

52. The method of claim 50, wherein said chromatography treatment includes one or more of 
the chromatography treatments selected from the group consisting of ion exchange 
chromatography, size exclusion chromatography, and liquid affinity chromatography. 

53. The method of claim 52, wherein said chromatography treatment includes use of one 
more chromatographic resins selected from the group consisting of an an anionic resin and a 
cationic resin. 

54. The method of claim 52, wherein the ion exchange chromatography treatment includes a 
first step using a strong anion exchange resin and a second step using a strong cation exchange 
resin. 

55. The method of claim 50, wherein said gradient centrifugation purification step comprises 
density gradient centrifugation. 

56. The method of claim 42, wherein said purification step comprises a first step of 
chromatography purification and a second step of gradient centrifugation. 

57. The method of claim 56, wherein said first chromatography purification step comprises 
liquid affinity chromatography. 

58. The method of claim 56, wherein said second gradient centrifugation step comprises 
density gradient centrifugation. 

59. A single-stranded oligonucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 
7332-7352, 7353-7385. 10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

60. A single-stranded oligonucleotide comprising the complement of the oHgonucleotide of 
claim 59. 

61. The oligonucleotide of claim 59 or claim 60, comprising 10-30 nucleotides. 

62. The oligonucleotide of claim 61, comprising the nucleotide sequence of SEQ ID NO: 
7292, SEQ ID NO: 7293, the complement of SEQ ID NO: 7292 or the complement of SEQ ID 
NO: 7293. 

63. A kit comprising primers for amplifying a template sequence contained within a SARS 
vims nucleic acid target, the kit comprising a first primer and a second primer, wherein the first 
primer comprises a sequence substantially complementary to a portion of said template sequence 
and the second primer comprises a sequence substantially complementary to a portion of the 
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complement of said template sequence, wherein the sequences withm said primers which have 
substantial complementarity define the termini of the template sequence to be amplified. 

64. The kit of claim 63, wherein the template sequence is contained within SEQ ID NO: 1 
and/or SEQ ID NO: 2. 

65. The kit of claim 63 or claim 64, wherein the first primer comprises a ftagment of 8 or 
more nucleotides of SEQ ID NO: 1, and the second primer comprises a fi-agment of 8 or more 
nucleotides of the complement of SEQ ID NO: 1. _ - 

66. The kit of claim 63 or claim 64, wherein the first primer comprises a fragment of 8 or 
more nucleotides of SEQ ID NO: 2, and the second primer comprises a fragment of 8 or more 
nucleotides of the complement of SEQ ID NO: 2. 

67. The kit of claim 63, wherein the first primer is an oligonucleotide according to any one of 
claims 59 to 62 and the second primer is an oligonucleotide according to any of claims 59 to 62. 

68. The kit of any one of claims 63 to 67, further comprising a labeled probe that comprises 
either a fragment of 8 or more nucleotides of SEQ ID NO: 1 and/or SEQ ID NO: 2, or the- 
complement of said fragment, which fra^ent is located within the template sequence. , 

69. .. The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS: 21-6020, 
6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 10235-10298. 10352- 
10504, 10580-11322 and 11325-11551. 

70. The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises the complement of a nucleotide sequence selected from the group consisting of SEQ 
ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352. 7353-7385, 
10235-10298. 10352-10504, 10580-11322 and 11325-11551. 

71 . A method of detecting the presence of S ARS virus in a sample comprising providing a 
sample suspected of containing a SARS vims nucleic acid target, amplifying a template sequence 
contained within said SARS virus nucleic acid target with the kit of any one of claims 63 to 70, 
and detecting the amplified template sequence, wherein the presence of the amplified template 
sequence indicates the presence of SARS virus in said sample. 

72. The method of claim 71 , wherein said amplifying is accomplished using polymerase 
chain reaction, transcription mediated amplification, reverse transcription PGR, ligase chain 
reaction, strand displacement amplification or nucleic acid sequence-based amplification. 

73. A double-stranded RNA molecule with a length from about 10 to about 30 nucleotides 
which is able to inactivate the SARS coronavirus in a mammalian cell. 
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74. The double-stranded RNA of claim 73, wherein the sequence of one of the strands is at 
least 90% identical to a target sequence, wherein the target sequence is a fragment of SEQ ID 
NO: 1 and/or SEQ ID NO: 2. 

75. The double-stranded RNA of claim 73 or claim 74, wherein the target sequence 

5 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS: 7292, 
7293, 7294. 7295, 7296, 7297, 7298, 7299, 7300 and 7301. 

76. The double-stranded RNA of any one of claims 73 to 75, comprising at least one^ 
modified nucleotide. 

77. A method for treating a patient suffering from SARS, comprising: administering to the 
10 patient a therapeutically effective dose of a molecule of less than 1000 g/moL 

78. The method of claim 77, wherein the molecule has an aromatic region and greater than 
one heteroatom selected from O, S, or N. 

79. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a therapeutically effective dose of a compound selected from: a nucleoside analog, a 

15 peptoid, an oUgopeptide, a polypeptide a protease inhibitor, a 3C-like protease inhibitor , a 
papain-like prqtease inhibitor, or an inhibitor of an RNA dependent RNA polymerase. 

80. A method for treating a patient suffering from SARS , comprising: administering to the 
patient a steroidal anti-inflammatory drug in combination with at least one antiviral compound. 

81. A method for treating a patient suffering from SARS, comprising: administering to the 
20 patient a therapeutically effective dose of a compound selected from: acyclovir, gancyclovir, 

vidarabidine, foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine^ 
didanosine, zalcitabine, an antiviral compound listed in Table 1; an antiviral compound Ksted in 
Table 2; or an interferon. 

82. The method of claim 81, wherein the interferon is an interferon-a or an interferon-p. 
Z5 83. The method of any one of claims 77 to 82, wherein the molecule or compound is 

delivered by inhalation. 

84. A method of identifying a therapeutically active agent comprising the steps of: (a) 
contacting a therapeutically active agent with a cell infected with the SARS viras; (b) measuring 
attenuation of a SARS related enzyme. 
JO 85. A viral vector or particle for in vivo delivery of a nucleic acid of claim 9 or claim 10. 

86. The viral vector of claim 85, wherein the vector is an adenovirus vector, a poxvirus 
vector or an alphavirus vector. 

87. An alphavirus replicon particle comprising one or more SARS viral antigens. 
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88. The replicon particle of claim 87, wherein said SARS viral antigen is a spike protein. 

89. The rephcon particle of claim 87, wherein said particle comprises a replicon derived from 
Venezuelan Equine Encephalitis (VEE) and further comprises an envelope derived from Sindbus 
virus (SIN) or Semliki Forest Virus (SFV). 

90. A vaqcine comprising one or more SARS virus antigens and one or more respiratory 
virus antigens. 

91. The vaccine of claim 90, wherein said respiratory virus antigens are selected from the 
group consisting of influenza virus, human rhinovirus (HRV), parainfluenza virus (PIV), 
respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and rhinovirus. 

92. The vaccine of claim 91, wherein said respiratory vims antigen is from influenza virus. 

93. The vaccine of claim 90, wherein said respiratory virus antigen is fixim a coronavirus 
other than the SARS virus. 

94. A polypeptide comprising an immunogenic, surface exposed fragment of the amino acid 
sequence SEQ ID NO: 6042. 

95. The polypeptide of claim 94, wherein said fragment does not include the last 50 amino 
acids of the C-terminus of SEQ ID NO: 6042. 

96. The polypeptide of claim 94, wherein said fragment does not include a transdomain 
region of SEQ ID NO: 6042. 

97. The polypeptide of claim 94, wherein said fragment does not include a C-terminus 
cytoplasmic domain of SEQ ID NO: 6042. 

98. The polypeptide of claim 94, wherein said fragment does not include a N-terminus signal 
sequence. 

99. An isolated polynucleotide comprising a nucleic acid sequence selected from the group 

consisting of SEQ ID NOS: 9968 and 10066. 

100. The polynucleotide of claim 99, wherein the polynucleotide comprising a nucleic acid 
sequence having > 80% sequence identity to a polynucleotide sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

101 . An isolated polynucleotide comprising a fragment of at least 15 conseicutive nucleic acids 
of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 9968 and 10066 
and wherein said fragment does not consist entirely of SEQ ID NO: 10033. 

102. An isolated polypeptide comprising an amino acid sequence encoded by any one of 
claims 99-101. 
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103. The polypeptide of claim 102, comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 9969 - 10032, 10067, and 10015. 

104. The polypeptide of claim 103, wherein the amino acid sequence is selected fiom the 
group consisting of SEQ ID NOS: 9997, 9998 and 10015. 

105. An expression construct for recombinant expression of a SARS virus spike protein 
wherein said construct comprises a nucleic acid sequence selected from the group consisting of 
SEQ ID NOS: 6578 - 6583. 

106. A mammalian cell line stably expressing a SARS viral antigen. 

107. The cell line of claim 106, wherein said cell line is a Chinese Hamster Ovary (CHO) cell. 

108. The ceU line of claim 106, wherein the SARS viral antigen is a spike protein or fragment 
thereof. 

109. The cell line of claim 106, wherein the spike protein is tiimcated to remove the 
transmembrane sequence. 

1 10. A method of identifying a therapeutically active agent comprising the steps of: (a) 
contacting a therapeutically active agent witii a buffer comprising SARS enzyme; and (b) 
measuring attenuation of the SARS enzyme. 

1 1 1. The method of claim 110 wherein the SARS enzyme is a SARS protease. 

112. The metiiod of claim 111 wherein tiie buffer further comprises a peptide witii a SARS 
protease cleave site. 

1 13. The metiiod of claim 110 wherein the measurement is made by the measurement of 
fluorescence. 

114. A vaccine of one of claims 22 to 37, and 90 to 93 further comprising an adjuvant. 

115. The vaccine of claim 1 14 wherein the adjuvant is a SMIP. 

1 16. The vaccine of claim 1 15 wherein the SMIP compound is selected from tiie group 
consisting of an acylpiperazine, a ti^ptantiirin, an indoledione, a teti^ydroisoquinoline, a 
benzocyclodione, an amino azavinyl compound, a thiosemicarbazone, a lactam, an 
aminobenzimidazole quinolinone, a hydropthalamide, a benzophenone, an isoxazole, a sterol, a 
quinazolinone, a pyrole, an antiiraquinone, a quinoxaline, a tiiazine, an benzazole, and a 
pyrazolopyrimidine, or a pharmaceutically acceptable salt, ester, or prodrug tiiereof. 

1 17. A method of vaccinating a subject comprising administering a vaccine of one of claims 
22 to 37, and 90 to 93. 

118. The method of claim 117 further comprising administering a SMIP. 



-643- 



wo 2004/092360 



PCTAJS2004/011710 



1 19. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMDDP compound. 

120. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMIS compound. 
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(990) tttttctttggatctagattaga tt ggccaaaq agaat 

(1033) 1Q 33 .1040 ,1050 .1060 ^^''^'''"lOTS 




BCV N 
MHVN 

Avian infectious bronchitis N 
Consensus 



BCVN 
MHVN 

A\dan Infectious bronchitis N 
Consensus 



0076) 1076 ^1090 ^jjoo 

(1049) T^aAT!n*&^fe%HigGte 



T7!Tr~rri ~ section 27 

^1^^9)1119 ^30 .1150 1161 



BCVN 
MHVN 

Avian infecUous bronchitis N 
Consensus 



(1095) ^tt^t^gagaccata 'm^3Qt^^m>mi:Mi^mk^m«pm^' 

(945) ^^GTA^AAi^TTGTGj^g^l^lliSg?^ 
(1119) TTTTGAGAC AT ATGAA aTlTTGAATGA GAf TTTgIa 



talks - TGA!-;bC€C&A= 

(1162) TGC TACCAA AACAAGGTGGTA GATGAA TOA GCCctS 
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FIGURE 3A (contd.) 
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(1205) 1205 .1210 
BCVN(il75) 
MHVN{1172) 
Avian infectious bronchitis N (1028) 
Consensus (1205) 



,1220 



.1230 



Section 29 
1247 



(1248) 
BCVN(1211) 
IWHVN(1214) 
Avian infectious broncNtis N (1068) 
Consensus (1248) 



(1291) 
BCV N (1254) 
MHV N (1257) 
Avian infectious bnonchiOs N (1 11 1 ) 
s(129i) 



r- , I. ,— 1 ~~~ — — l^fn 

AAps;%c;A!G©fe^nc^- - -pi^pgirg^ iAidA^&TGjsa^S 

A CCACAACGT A GCGTGGACAGG T A GAA GACAAG 

' — _ SecHon 30 

•'248 .1260 .1270 .1280 1290 

SCASCi|5GCG:TC:?.iAiGi^^GAGAA^iGTfeA^^^^ 

A GAAA GATAATATA&G GTTGCAA GCC AAAAGC 6TGT 

■ — • Section 31 

1291 ,1300 ,1310 .1320 1333 



(1334) 
BCV N (1294) 
MHV N (1297) 
Avian infectious bronchitis N (1150) 
Consensus (1334) 



S^ATSAgGTiG^^O' pCiit3^ip^T;fe^:|^|iiTaA^f 

GCAGCAAAATATAAGTAGAGAATTGACC CAGA 6GA AG 

• Section 32 

1334 .1340 .1350 ,1360 1376 



liAgAAfGCACAq^iG^Tf 
AGCCTT T TCA6AA TTOATQATG 



(1377) 1 
BCV N (1325) ( 
MHV N (1340) C 
Avian Infectious bronchitis N (1193) l 
Consensus (1377) 



Gp^ijCTiS^gS 

>CGTAQT||ipA(^il!G 
-lCCCAA^GTQAT^PJ 
GCC ATA 
Section 33 




GAAGAT CCTCA ATGTATAA 



BCVN 
IV1HVN 

Avian infectious bronchitis N 



SEQ ID NO: 9883 
SEQ ID NO: 9894 
SEQ ID NO: 9903 
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FIGURE 3B 



HOBMPRO 
BCVM 
MHVM 

Avian infectious brochiiis virus M 
Consensus 



HOBMPRO (42) li^S^JgaTA 

BCV M (10) GTii:^e'I>J 

MHVM (10)|ccF^^" 

An/ian fnfecGous brochitis vims M (1 ) -^'^ ^ ^.^ 

Consensus (42) a aactc 



— — Secflon 1 

(1) 1 J[0 go 30 4i 

(1) GATGTGGATGACGTTTAGGTAATCCAAACATTAI^SAG^Tf 

(t) IIIIIIIIIIIIIII""""! ^'i^k^"^ 

^"^^ ATGAGTAGT 

Section 2 

(42) 42 50 .60 ^o 82 



HOBMPRO 



(83) 83 



CTCCAGCGCCAGTTTATCTCTGGACTGCTGA 

— — Section 3 

.110 123 



Consensus (83) TGA GGCTATTAAATTCgTTAAGGAATQGAATTTTTCTTTGG 

~ — — — Section 4 



(124) 124 



HOBMPRO (121) SI 
BCVM (89)|§' 

Consensus (124) gtatt atactactttttattactatcat.^^ 

— — — Section 5 

_.180 .190 205 



(165) 165 ,170 



■HOBMPRO (162) 'j^AwmB^^j^^^M^^M^mm^^^mmS 

Avian infecuous b«K:hlUs virus M (118) ^^^Si^AlA^Gi^l^lig 

cons ensus (165) tatacaagt cgcagtatgtttgtttatgttattaagat 

' ~ — Sectbn 6 

230 246 



(206) 206 
HOBMPRO (; ~~ 
BCV^ 

MHV M (174) : 



Consensus (206) aATTTTGTdGCTTATQTQGcfeoTSSTMiGTHT?t^^^ 
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FIGURE 3B (conid.) 




Avian Infectious brochitisvlais M (200) ^AAirp^CA^TA^'^iii^icWcCA* 

Consensus (247) TTTTTAATTGCGTATATGCGTTGAATiATGTGTATCT^^ 



- Section 8 

(288) 288 300 310 ^ 328 

HOBMPRO(285) C'I},TT.CT?k:rlKi&T^iliTTACC'^TAG^F'<3dcC 
BCV M (253) S>.3:T'G7Apr|i«i!3T.>E;^^:ciiOT 
, , . MHV M (256) i|filCi#i?#3irpg^ 

Avian mfecUous brocWtis virus M (241) GCAGlGG|i^^A*'AC!^^%5(3iAQa^^ 

Consensus (288) ttttctatagtttttactataotogccattat atgtqgat 

-— — — Section 9 

(329) ^9 J40 350 



Avian Infectious brochltis Vims ^ - , 

Consensus (329) tgtgtat t t t gt i^aatagt atc aggttgt^^^ 

— _____ Section 10 

(370) 370 380 390 .400 410 

HOBiVIPRO (367) GA^AG-g^TTTGG^GTTJDCAAdteblsrGA?^ 
BCV M (335) ^^p)E^iii;ESdiSi^«|^a^^ 
^, . MHVIVI(338) GCXtoGG^]^G:^G'ci'e^M^ 

Avian infectious brochitis virus IVI (323) GGCAATgj^^i^yGCAi^T^ps^^ -^I^ci 
Consensus (370) gaagttggtggagtttcaacccagaaacaaacaacttg^ 

— — Section 11 

(41 1 ) 411 , " 430 i440 451 

HOBiwpRo (408) ^^$^p,^^!^c^^^^mf^^m^?^^mwxsM 

BCV M (376) 'SC'JlKXh^?^'R1^^^^^G<$'4K'^QG%^:<i^^ 
ly^HVIW (379) TaT>iTA.GATATGAA^d^>VCT|G3P^,T>^^^ 
s virus M 1» ■A<4nt'f!_sHr'^=i!i&T«ai4T«V^r?!^^ 



Avian infectious brochitis virus M (359) t'- - - -i^^Q*--'5d?CSiil^TAC^^ 

Consensus (411) tgtatagatatgaaaggtactatgtatgttaggccga^ 

Section 12 



(452) 452 

HOBMPRO 
BCV 
IWHV 



Avian infectious bnaciiltis vims M (392) G'sKiCi'^^^^'^m^'S':^- '--^^^ 
Consensus (452) tgagga'ttaccatacactgacggtcacaataatacgtg^^^ 



BNSDOCIO: <WO__20040923eCIA2J_> 
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FIGURE 3B (contd.) 
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13 



(493)493 .500 sio 500 coo 

Consensus (493) atctttacatggaaggt ataaagctaqgtactggcSttcI 

(534) 534 .540 550 ^ ^^''*'°"Ji 

Consensus (534) TTGTCTGATT TacC^G&TTATGTGAcf 1 




HOBMPRO (613) 
BCyM(581) 



consensus (616) GCGATACTAGTGGTTTTGCTGT TTATGTTASf TCCAiAGTg 

(657) 657 ~ Section 17 

Consensus (657) ggtaattaccgactgccatcaa ccca^aagIgtt ctgScIt 

(698) ggS 710 ^ Section 18 



Avian infectrous b«,chitis ^ims S (eS^} ^r?????????? 

Consensus (698) Qgacaccgcattgttgagaaataatatctaa " 



(739) 

HOBMPRO (736) TG SEQ id nO: 9912 

BCVM(694) -- SEQ ID NO: 9884 

MHVM{688) -- SEQ ID NO: 9895 

Avian infecUous brochilis virus M (601) - - seq id no: 9904 
Consensus (739) 



- Section 19 
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FIGURE 3C 

, , — — — Section 1 

{1)1 JO ^ Jiy .30 .40 • 53 

HOBHEGA (1) g^^Ai^A^qiiAQ^ 
BCVHE . (1) ^J^fi^Piit^li^^ 

MHv HE (1) - - - - - - - - - - ^ . _ i - : : ; 

Consensus (1) CTAAACTCAGTGAAAATGTTTTTGCTTC TAGATTT TTCTAGTTAGCTGCAT 

: Section 2 

106 



HOBHEGA 
BCVHE 
MHV HE 

Consensus 



(54) 



go 



HOBHEGA 
BCVHE 
MHV HE 

Consensus 



{54) ^^i^^gtfGp\^t:T|Ci'lj|^^^ 

(54) g?i-^'i|p^iGjscafj^"^^ipaSQ^^ 

(1) - - ----- - ----- - - - -------- - - --- ---- 

(54) AATTGGTAGC TAGGTTTT ACAA CCTCCTACCAATOTTGTTTCGCAT TAA 

II. Section 3 

(107) 107 .120 • .130 J40 159 

(107) ivl' ' 
(107) ^ib| 
(1)"--- 

(107) ATGGAGATTGGTTTTTATTTGGTGACAGTCGTTCAGATTGTAATCAT TTGTT 
— Section 4 



(160) 160 
HOBHEGA (160)^^ 
BCVHE (160) J 
MHV HE (1) ■ 

Consensus (160) A ta caaccccc taattattcttatatggaccttaatcctg 



TGTGTG 
— Section 5 



HOBHEGA 
BCVHE 
MHV HE 

Consensus 



(213) 
(213) 
(213) 
(1) 
(213) 



ttctggtaaaatatcatctaaagctggcaactccatttttaggagttttcact 
■ Section 6 



HOBHEGA 
BCVHE 
MHV HE 

Consensus 



(266) 
(266) 
(266) 
(1) 
(266) 



ttaccgatttttataattacacaggcgaaggtcaacaaattatttt tatgag 
Section 7 



(319) '< 
(319) t 
(319) ^ 

(1) - 

(319) ggtgttaattttacqccttatcatgcctttaaatgca c ttctqgtagtaa 



HOBHEGA 
BCVHE 
MHV HE 
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FiGURE 3C (contd.) 



HOBHEGA 
BCVHE 
MHVHE 

Consensus 



(372) 372_ 



390 



Section 8 
424 



~~ ' Sections 




HOBHEGA 
BCVHE 
MHVHE 

Consensus 



(478) 478 .490_ ^00 ■ 510 520 

~~ — Section 1 1 



HOBHEGA 
BCV HE 
MHVHE 

Consensus 



(531) 531 



, , , -^0 ,,55" , ,560 .570 583 



(531) ^ 

(S31) TGCATATATAGCTC TGAAGCTAATT TGGGGATTATTATiATAAGGTTGfSG 
(584)584 ^ — — Section 12 



HOBHEGA 
BCVHE 
MHVHE 

Consensus 



HOBHEGA 
BCVHE 
MHV HE 

Consensus 



(690) 690 ^ ~ — —Section 14 

(675) iTj/^G^f 5; 



?G AGAGTtmGm* A J\ TA-'f f3ra S.^?^^ 



(690) TAGTCAATATTATTTTAATAAAGACACTGGTGTTATTTATGGTCTcSSIIcTt 
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FIGURE 3C (conid.) 



. . Section 1 5 

(743) 743 .750 ^Jl^ 770 780 795 

HOBHEGA (728) ^ - AGiii^G€^^li51e;q^i. t3>G<3^Fp,TTG>TCTTAATTGT!TAi^ 

BCV HE (728) Q ^^^(^k^QC^^'C^cisiJti 6:''^j3^j'i|^t?i?^:&cjc'r;^^^^ 

MHV HE (362) qCTTGGATGTiiiGC^^iiitC^G]ejT|t^ 

Consensus (743) c tgaaaccattacca ctggttttgatcttaattgt at 

. '. ; Section 16 

(796) 796 
HOBHEGA (766) 

BCVHE (766) . , .. 

MHV HE (416) f d|«C^TiCA|-TGAiC.TC^*0iGfe*5v^^ 

Consensus (796) tatttagttttaccctctggtaattatttagccatttcaaatgagctattgtt 

-■ Section 17 




(849) 849 



HOBHEGA (819) l|il^piS^CG^il^Q|^^^!^p|^i 

BCVHE (819) A^£!T/ST.TiGe^^CTAA,^jaG^^^ _ . 
MHV HE (468) p^eGCTi^lJlAi^aCTOAATpG^Ti^^ 
Consensus (849) AACTOTTCCTAC AAAGCAATCTGTCTTAATAAGC6TAA6QATTTTACGCCTG 
■: Section 18 



(902) 

HOBHEGA (872) 
BCVHE (872) 
MHV HE (621) 

Consensus (902) 

(955) 

HOBHEGA (925) 
BCVHE (925) 
MHV HE (574) 

Consensus (955) 



902 ,910 §20 ?30 . ^ _ ^40 954 

|FG^|SL(?'^'3PA^"?r"g?.^^C'§.C|AA,^:^^^^ 

T ACAGQ TTGTTGAT T CGCGGTGQAAC7LAT G CC AGO C AOT CT6ATAAC ATG ACQ 

Section 19 

955 J960 970 980 .990 1007 

66 QTTQCTTGTCAACC CCGTACTGTTATTTTCGTiiJVTTCTACTACCAATTA 

Section 20 

,1050 1060 



(1008) 1008 ,102 0 ,10 

HOBHEGA (978) T^TTGeTGliTji* iiT,i^V^^^%^^ 

BCVHE (978)^'^^-'^^-"— ifiA^-^i^^i: 
MHV HE (627) 
Consensus (1008) 



.1040 



*AG.^<3<^T.qGCACAC3iTi!3;?i^QCO 



) TGTTGGTGTTT 



iGGCAGTOF^T 

ATGATAT AATCATOG GATGCTGGTTTTACTAGCATAC 

Section 21 
1113 



.1080 



.1090 



.1100 



(1061) ~ ■■— ^ , 

HOBHEGA (1028) ^^^^r^d^Tm^^^^'^S^^^^^ 

MHV HE (680)^ 
Consensus (1061) ^ 



) T AGTGGTTTGTTATATAATTCACCTTGTTTTTCGCAGCAAGGTGTTTTTAGG 



BNSDOCID: <WO__^004092360A2J_> 
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FIGURE 3C (contd,) 



/L120 



1130 



.1140 



^ Section 22 

116S 



(1114) 1114 
HOBHEGA(1081) 

Consensus (1114) TATGATAATGTTAGCAGTGTCTGG CCTCTCTACCC StGGCAGATGTGC AC 

— — ~ SecUon 23 

(1167)1167^^^ .1180 _.1190 1200 191q 

MHVHE (786 Qf5g^©!Sl?feAi^C^®GG^T&- - -feiATG^^<ja^ 
Consensus (1167) TGCTGCTGATATTAATAACCCTGA TGTACCTATTTGT^ 

~~ " '■ ■ '. .Qoofinn OA 



HOBHEGA 
BCVHE 
MHVHE 

Consensus 



(1220) 1220 



1230 



HOBHEGA 
BCVHE 

MHvne 

Consensus 



^Ct^O)^CAGTTATTTa^^ 

;j240) i»iipi3«p5j^6Mf^^ 

(889) ^^t^^KTiaf II.M'^cg^PS^ 



(1273) TTGTTGTTATATTT TATGGTGGATAATQGTACTrGGCTGc£TlAT^^^^ 

— Section 26 



HOBHEGA 
BCVHE 
MHVHE 

Consensus 



(1326) 132g 



1337 



(1293) e^TkAT'e'ItSAAdi SEQ id NO: 9913 

(1293) i;c^q?A^i^9AAA|! SEQ ID NO: 9885 

(940) SEQ ID NO: 9896 
(1326) CATAATCTAAAC 
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FIGURE 4 

FIGURE 4A 



avian infectious bronchitis pol 1ab 
bovine coronaviais pol lab 
Hunian corona 229E pol lab 
Murine hepatitis pol lab 
Consensus 



avian Infectious bronchitis pot lab 
bovine coronavirus pol lab 
Human corona 229E pol lab 
Murine hepatitis pol lab 
Consensus 



— — — — section 1 

(1) 1 JO go 39 

(1) . --zrrzTi-~rrrrirriri-~: 

(1) SiN|^pBLHi«illiiFEDlEiiiD»iipvilV 

(1) ^^^ACltfR VT L A V A sMIe 1 S AN G 

(1) i^^G^lGFKitl^i^I.P|i;Sf^Gt#iR|iiEiGF 
(1)MAKI KYGL WAPEPPWJid NA EKL NPDSSE D 

— Seotion 2 

(40) 40L„ ^ JO 78 

(1) 



avian infectious bronchitis pol lab 
bovine coronavirus pol lab 
liuman corona 229E pol 1ab 
IMurine hepatitis pol lab 
Consensus 




(79) ^fflnTlilYD^LLQDiS^f E^^TiPLGigiEiCY 

(60) T YgpGLHGfiG'PL FCN^MK FS DlS^P Fpl^HG - WLVfil 

(79) SPiDEDiiKiiAST^feeiFGsglclK^fSKRliiQi^ 
(79) DIVM PN LE IMALQ R AVIi~V P LSI AFS 
\ — — . — Section 4 



avian infectious bronchitis pol lab 
bovine coronavirus pol 1ab 
Human corona 229E pol lab 
Murine hepatitis pol 1ab 
Consensus 



avian infectious bronchitis pol lab 
bovine coronavirus pol 1ab 
Human corona 22gE pol lab 
Murine hepatitis pol 1ab 
Consensus 



avian infectious bronchitis pol 1ab 
bovine coronavirus pol 1ab 
Human corona 229E pol 1ab 
Murine hepatitis pol 1ab 
Consensus 



(118). 118 ,13g_ 1S6 

(1 ) iAS||LKQ.^is Pi— — pi^DVi ipS^I 

(118) VRfCNiNGiTfi:^^fes^^|ia^R^^Si^NKSt^^ 

(93)|i;SNYlLEEjD^FG#i; ^GGSiT YT DQ^CG A®G 

(118) iLfviiK^Ai^^cgp^Ei^CDAiPli^^TVQ- 
(118)1)1 G LP SF MGIiFKR LCNTG CAV HVAYLLF D 

— -— Section 5 

(157) 157 J70 ,180 J95 

(24) ii 
(157)1 

(126) 1 _ _ 

(167) ^i^LpMM^^Ti^Y^f^Q^sSL^lG 
(157) PDGVCDGLGQPVGWFIPL AIPINARQFI PWLI LKK 

■■■ . SecOon 6 

(196) 196 210 ^220 234 



(63) K|!ET — ^C--GlFpLKpvSKillTPpjVP^«iV 

(1 96) §E!KGi- A Y N K D H^|g F E h|PjIf K^|S^Yg^i^Eif!!|l^i§ 

(165) b§p^ Q Nk^|iE?iiiEYSVlHGi>^^ 

(196) ^Nig'^S VT S GH FM'iCvTpPpflDFW^^gt^C^E^EiJiH^^ 
{196)GFKG k'rA M VYHL VEDA DLVH DAP KGKF 
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FIGURE 4A (contd,) 




avfan InfecQous bronchitis pot lab 
bovine coronaviais pol lab 
Human corona 229E poi lab 
Murine hepatifis pot lab 
Consensus 



... „ 290 .300 



(274) ^ _ 

(274) D I T L AE T K DL F P I'w S D 



avian Infectious bronchitis pol 1ab 
bovine coronavirus pol 1ab 
Human corona 229E pol 1ab 
Murine hepatitis pol 1 ab 
Consensus 



(313) 313 



DV VAWHVDRDPRKaMRio 

Section 9 

330 .340 351 



avian Infectious bronchitis pol lab 
bovine coronavirus pol 1ab 
Human corona 229E pol 1ab 
Murine hepatitis pol lab 
Consensus 



avian infectious bronchitis pol lab 
bovine coronavirus pol lab 
Human corona 229E pot 1ab 
Mifllne hepatitis pol lab 
Consensus 



166) fPf^^MDAiGS^S AL PQ^KQQ^^I FQi8Al?llPENSgN 

(312) |A'6=^$I^S.>ni«^ANJFiVCD^^"^ — 
(268) ^liNKI^Mfp gMc P G%i,f i tUqqIIg AG^Y 

(313) VIATVRCVLYVNQPTEDLVDGSVVAREPIKLLAA SIV 

— Section 10 

(352) 352_._.^ J60 37 0 380 390 

(205) .E#^RgA A^KM A||AKCiR sir^V^E^T LVVKEFAGTCL 

(351) f^jQH8P.^||^SCFteMDipjNAPiiGVDgK^ 

S2«) SM. — ^---^«*'«^SQsiDCFVAgi^ 

(352) R§f B TMLYT D^J|t B P C^KT-^Cp^^gb^i^ 

(352) RLPN I IL FIBAAiSVI VI YL KL d'^CGFISQFGY 

Section 1 1 
429 



(391) 391 



_4p0 



390) i|pEQiM?Mig|p||^to^ 
336 S|KVHR^iCFlS|fts|^iBCRPvM^^ 
(388) ^^iiCfeX'iy^^^iA^J^^i^Ppjri?GCTKNS^^^ 
(391) IDC GDLCDFKGWVPNNMMDGF CTTLG YESAVRLAQ 
SecCon 12 



avian infectious bronchitis pol 1ab 
bovine coronavirus pol 1ab 
Human corona 229E pol lab 
Murine hepatitis pol lab 
Consensus 



(430) 430 



,440 



/450 



KpT QVVV RG MgNDLT L^D 

(375) ?^g^|;|p||S|l|G DiYDjE?!!!^ -rTZL -ik- P W fI^'R^ 

(427) i^Sj3^p;KEj0;G;tea^ KlCMll'^lHAlv^PFli 

(430) SSGViiPIGTVLFTVSADAPG 'kDSPKLYG VV KA 
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FIGURE 4A (contd.) 



- Section 13 
507 



(469) 469 



.490 



avian infectious bronchitis pol 1ab (322) pi PVEPEGsififlpiDGHiic YVFgjGi3jRFYARpgs*a§^ 

bovine ooronavims pol 1ab (468) Gci||p-||ARN!i^|lpK^^K^^^^ 

Human corona 229E pol 1 ab (405) |]d^6P - ci[s|fi^ASAijKQ LK,^^G^Lg^VKSp|c§s'^^ 

Murine hepatitis poll ab (462) SAf^^Ki-i,CPj^lt^|;j^i|^ 

Consensus (469) 0 VYw's PWSAIwrPII. SSvksYDGL YTCVVGNKAIV 
, ^ Section 14 



avian Infectious bronchitis pol lab 
bovine coronavirus pol 1ab 
Human corona 229E pol lab 
Murine hepatitis pol 1ab 
Consensus 



(608) 508_ 



_i52g_ 



(361) DgHCCER^CLskGlT PEINDGLI LlA!|iYs|iFSVSi|Li^ 
(506) KEijNlMKAi^^i^^lMHj^^Pig^iK^^ 
(443) ViGGTfQ I LA SjgPEKPLH AF Df|FVTij|QT^^F>aCAViTCl 
(500) QBiDJ^^E^^^^K^^^MFigl^R^ 

(508) VT ICRSLYLDYVQHKCGNL QRAILGVSDVWHEQLT 
Section 15 



a>rian infectious bronchtUs poi lab 
bovine corona^Hrus pol lab 
Human corona 229E pol lab 
Murine hepatiUs pol lab 
Consensus 



(547) 547 



J56Q 



■570 



585 



(400) ALi|i^EPFKFLGHKFVY^lSD|i 
(545) IgiGV^Kpi^l^Sy^MiWKPi 
(482)|AaMiiDKl|Ff 
(539)p5iGi>iisLld 
(547) INRKAY LLENVDLFNARRA VS KLPAVCADGLVPLI. 
'■ Section 16 




avian infectious bronchitis poi lab 
bovine coronavirus pol lab 
Human corona 229E pol lab 
Murine hepatitis poi lab 
Consensus 



J6Q0 



010 



624 



(586) 586 , 

(439) FqISa rPiIe'dMw SifelP T E Kp!F E F W^^S^I^RN L EjpFp^T 
(584) iJoiM^P^iilii^JAVl^i^CDYAD^p^^ 

(521) Y Ai^mpG-gfT E I'i^l'iS Rp/^E RjiBiT^V'Ji'^^.fllN NjXiS'Kl. 
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FIGURE 4F (contd.) 
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mouse hepatitis virus 8 (934) 
Consensus (1001) 



IRDIICVQSYNGIKVLPPLLSENQISGYTLAATA 

— Secflon 26 

.1010 .1020 ^0 1040 




(1041) 

human coronavirus OC43 S (1008) 
avianinfecUousbronchiUsvlrusS (797) 
bovine coronavirus 8 (1018) 
mouse hepatitis virus 8 (974) 
Consensus (1041) 



ASLFPPWSAAAGVPFYi,NVQYRINGLGVTMDVLSQNiQKLI 

Section 27 

1041 .1050 .1060 _„ .1070 1080 




(1081) 

human coronavirus OC43 S (1048) 
avianinfecttousbronchitJsvifusS (837) 
bovine coronavirus S (1058) 
mouse her»titis virus S (1014) 
Consensus (1081) 



ANAFNHAL6AIQEGFDATNSALVKIQAVVNAKAEALNNLL 

'■ Section 28 

1081 .1090 .1100 J 110 1120 




(1121) 

human coronavims OC43 S (1088) 
avianinfecffousbronchitisvirusS (877) 
bovine coronavirus 8 (1098) 
mouse hepatitis virus 8 (1054) 
Consensus (1121) 



QQLSNRFGAISASLQEILSRLDALEANAQipRi.INGRLTA 

Section 29 

li?! .1130 J14g 1150 1160 




(1161) 

human coronavirus OC43 S (1128) 
avianinfectiousbronchitisvirusS (91 7) 
bovine coronavirus 8 (1 138) 
mouse hepatitis virus S (1094) 
Consensus (1161) 



LNAYVSQQLSDSTLVKFSAA Q AM EK V N EC V K S Q S S R I N F C 

— Section 30 

- '^VJ9^^ - .1 190 1 200 



GNGKHIISLVQISAPYGLYFIHFSYVPTSPVTAKVSPGLC 
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FIGURE 4F (contd.) 



(1201) 1201 



.1210 



.l1220 



.1230 



- Section 31 
1240 



human coronavims OC43 S (1 168) 1 

avlaninfBCfiousbrondiiflsvtrusS (957) KPAi||sQYAlVP^N6R|| 
bovine coronavtois S (1 178) j^^t>i|ia;- - - - - 

mouse hepatitis virus S (1 134) [|iii»^C^ -aijfjssssrat^^ai!'^ 

Consensus (1201) AG ORG lAPksiSYFVNVNNTWMFTGSGYyyP^EPIT 




- Section 32 



(1241) 

human coronavlrus 0C43 S (1202) 
avianinfectiousbronchitisvirusS (997) 
bovine coronavirus S (1212) 
mouse hepatitis virus S (1 168) 
Consensus (1241) 



(1281) 

human coronavlrus OC43 S (1241) 
avianinfeofiousbronchitisvirusS (1037) 
bovine coronavlrus S (1251) 
mouse hepatifis virus S (1207) 
Consensus (1281) 



1241 ^1250 _ 126 0 ,1270 1280 

: '■ Section 33 

1281 .1290 .1300 .1310 1320 




FKNQTSVAPDLSLDY INVIFLDLQ EMWRIQEAI KVLH 
Section 34 



(1321) 

human coronavirus OC43 S (1279) 
aVpaninfectiousbronchltisvirusS (1 075) 
bovine coronavirus S (1289) 
mouse hepatitis virus S (1247) 
Consensus (1321) 




(1361) 

human coronavirus OC43 S (1319) 
avianinfectfousbronchitlsviaisS (1115) 
bovine coronavirus S (1329) 
mouse hepafiSs virus S (1287) 
Consensus (1361) 




TSCFKRCGGCCDDYTGHQELVIKT 
: S^tion 36 



(1401) 1401 1408 

human coronavirus OC43 S (1350) |[Hf 1 : 

avlanlnfectiousbronchtiisviru5S(1155) QUiRPKKSV SEQ ID NO: 

bovine coronavirus S (1360) SEQ ID NO: 

mouse hepaOtIs virus S (1318) mI s s he d - seq id no : 
Consensus (1401 ) SHOD 



9918 
9909 
9891 
9902 
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FIGURE 5 



human coronaviais OC43 S 
bovine coronavirus S 
mouse hepatitis vims A59 S 
Consensus 



(589) 589 



Section 15 

.620 630 



human coronavinis OD43 S 
bovine coronavinis 5 
mouse hepatibs vinjs A59 s 
Consensus 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



N CTC PQAiS'LGWS KsCLQGDRCNI FANFI LHDVt^^^ 
— ■ — — ^ Section 16 

{631) i^^^A^^^^^^^^NY^^i^ 

— Section 17 

(673) 673 i68Q J690 ,700 714 



(649) griw 

(659)"^:T.:.gVs'i3y^ 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis virus A59 S 
Consensus 



(673) LLYDSNGNLYGFRDYITNRTFMIRSCYS 6R v¥a1a F H AN S s'eP 

— — Section 18 

(715) 715 .720 730 ^40 756 

(«9">) liiiiiM^S^^SSiMpiii^^^Yi 



(701) i 

(715) ALlFRNIKCNYVFRNSLSRQLQPINYFDSYLGCVViiADNSTA 
g ^ g2 — ^-^SectiOT 19 



liuman coronavirus OC43 S 
ttovine coronavirus S 
mouse hepatitis vims A5d S 
Consensus 
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(733) lifiiiiwe^^ii,.^^ 

(743) S VV Q T G bt:a?Ms"{^X"ff 

(692) K$$P m^%'Rp'0lk§hi4^ _^ 
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human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis vims A59 S 
Consensus 
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RRSRRSITTGYRFTNFEPFTV 

Section 20 

jB20 . .830 840 



(775) i 

(734)'S^»i&spsii^M^iiii^^^ 

(799) NSVNDSLEPVGGLYEIQIPSBFTIGNMEEFIQTSSPKVTlbc 

Section 21 

(841) 841 ^50 fiSO ^ 882 



human coronavirus OC43 S 
bovine coronavirus S 
mouse hepatitis x^rus A59 S 
Consensuis 
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FIGURE 6 

FIGURE 6A 
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FIGURE CB 



PCT/US2004/011710 




wo 2004/092360 



PCT/US2004/01 1710 



60/193 
FIGURE 6C 
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FIGURE 7 



FIGURE 7A 



SEQ ID NO: 6053 MFVLLVAYALLH : 12 

SEQ ID NO: 6057 MKKLFVVLVVMPLIYGDNFPCSKLTNRTIGNQWNLIETFLLNYSSRLPPNSDVVLGD 60 

SEQ ID NO: 6061 MRSLIYFWLLLPVLPTLSLPQDVTRCQSTTNFRRFFSKFN ^VQAPAVWLGG 52 

SEQ ID NO: 6065 MFLILLISLPTAFAVIGDLKCTTVS-INDVDTGVPSIS 38 

40 
39 



SEQ ID NO: 6069 
SEQ ID 110:6042 



-MLFVFILFLPSCLGYIGDFRCIQLVNSNGANVSAPSIS- 
-MFIFLLFLTLTSGSDLDRCTTPDDVQAPNYTQHTSSMRG- 



I ID NO: 6072 MLVTPLLLVTLLCALCSAVLYDSSS '■ 27 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID. NO: 6069 
SEQ ID NO t 6042 
SEQ ID NO: 6072 



YFPWQPWFNCIRNDSNDLYVTLENLKALYWDYATENITWNHRQRIJ^^ 12 0 

YLPSMNSSSWYCGTGIETASGVHGIFLSYIDSGQGFEIGISQEP FDPSGYQLYLH 107 

TDTVDVTNGLGTYYVLDRVYLNTTLLLNG YY 69 

—TETVEVSQGLGTYYVLDRVYLNATLLLTG YY 71 

VYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFG ---HP 79 

^ YVYYYQSAFRPPSGWHLQG 46 



SEQ ID NO: 6053 lAGCQTTN GLNTSYSVCNG 31 

SEQ ID NO: 6057 TTRNFNSAEGAIICICKGSPPTTTTESSLTCNWGSECRLNHKFPICPSNSEANCGNMLYG 180 

SEQ ID NO: 6061 KATNGNTNAIARLRICQFPDNKTLGPTVNDVTTGimCLFNKAIPAYMRDGKDIWGITWD 167 

SEQ ID NO: 6065 PTSGSTYRNMALKGTLLLSTLWFKPPFLSDFTNGIFAKVKNTKVIKD6VMYSEFPAITIG 129 

SEQ ID NO: 6069 PVDGSKFRNLALTGTNSVSLSWFQPPYLSQFNDGIFAKVQNLKTSTPSGATAYFPTIVIG 131 

SEQ ID NO: 6042 VIPFKDGIYFAATEKSNVVRGWVFGSTMNNKSQSVIIIHNSTNVVIRACaiPEW 139 

SEQ ID NO : 6072 . GAYAWNISSEFNNAGSSSGCTVGIIHGGRWNASSIAMTAP---- 88 

SEQ ID NO: 6053 

SEQ ID NO: 6057 LQWFADEWAYLHGASYRISFENQWSGTVTFGDMRATTLEVAGTLVDLWWFNPVYDVSYY 240 

SEQ ID NO: 6061 NDRVTVFADKIYHFYLKNDWSRVATRCYNRRSCAMQYVYTPTYYMLN 214 

SEQ ID NO: 6065 STFWTSYSWVQPHTTILGNKLQGFLEISVCQYTMCEYPNT 171 

SEQ ID NO: 6069 — SLFGYTSYTWIEPYN 6VIMASVCQYTICLLPYT 165 

SEQ ID NO: 6042 VS KPMGTQTHTMIPDNAFN — — 158 

SEQ ID NO: 6072 



SEQ ID NO: 6053 CVGYSENVFAVESGGYIPSDFAFNNWFLLTNTSSWDGWRSF 74 

SEQ ID NO: 6057 RVNNKNGTTWSNCTDQCASYVANVFTTQPGGFIPSDFSFNNWFLLTNSSTLVSGKLVTK 300 

SEQ ID NO: 6061 VTSAGEDGIYYEPCTANCTGYAANVFATDSNGHIPEGFSFNNWFLLSNDSTLLHGKWSN 274 

SEQ ID NO: 6065 ICNPN-LGNQRVELWHWDTGWSCLYKRNFTYDVNADYLYFHFYQEGGTFYAYFTDTGW 230 

SEQ ID NO: 6069 DCKPNTNGNKLIGFWHTDVKPPICVLKRNFTLNVNADAFYFHFYQHGGTFYAYYADKPSA 225 

SEQ ID NO: 6042 -CTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGPLYVYKSZQPIDWRDLPSGFNTLK 217 

SEQ ID NO: 6072 SSGMAWSSSQFCTAHCNFSDTTVFVTHCYKHGG—CPLTGMLQQN 131 

SEQ ID NO: 6053 QPLLLNCLWSVSGLRFTTGFVYFNGTGRGDCKGFSSDVLSDVIRYNLNFEENLRRG— T 131 

SEQ ID NO: 6057 QPLLVNCLWPVPSFEEAASTFCFEGAGFDQCNGAVLNNTVDVIRFNLNFTTNVQSGKGAT 360 

SEQ ID NO: 6061 QPLLVNCLLAIPKIYGLGQFFSFNHTMDGVCNGAAVDRAPEALRFNINDTSVILAEG— S 332 

SEQ ID NO: 6065 TKFLPNVYLGTVLSHYYVMPLTCN SALTLEYWVTPLTSKQYLLAFNQDGVIFNAVD 286 

SEQ ID NO: 6069 TTFLFSVYI6DILTQYYVLPFICNPTA6STFAPRYWVTPLVKRQYLFNFNQKGVITSAVD 285 

SEQ ID NO: 6042 PimPLGINITHFRMLTAFSPAQDIWGTSAAAYFVGYIJamEHIiKyDENGTITDAVD 277 
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SEQ ID NO: 6072 LIRVSMKNGQLFYNLWSVAKYPTFRSFQCVmiTSVYIJIGDLVYTS^^ 



SEQ ID NO: 6053 ILFKTSYG VWFYCTNNT-LVSGDAHIPFGTVLGNFYCFVNTTIGNETTSAFVGAL 186 

SEQ ID NO: 6057 WSLNTTGGWLEISCYTVSDS-SFFSYGEIPFGWDGPRYCYVHY---NGTAL^^ 416 

SEQ ID NO: 6061 IVLHTALG TNLSPVCSNSSDPHLAIFAIPLGATEVPyYCFLKVDTYNSTVYKPLAVL 389 

SEQ ID NO: 6065 CKSDFMSEIKCKTLSIAPSTGVYEMGYWQPIADVYRRIPNLPDCN-IEAWMJKSV^ 345 

SEQ ID NO: 6069 CASSYTSEIKCKTQSMLPSTGVYELSGYTVQPVGWYRRVANLPACN-IEEWLTARSVPS 344 

SEQ ID NO: 6042 CSQNFZJ^LKCSVKSFEIDKGIYQTSNFRWPSGDWR-FPNITNLCFFGEVFNATKFPS 336 

SEQ ID NO: 6072 FKAGGPITYKVMREVKALAYFVNGTAQDVILCDGSPRGLLACQYNTGNFSDGFYPFTNSS 251 

SEQ ID NO: 6053 PKTVREFVISRTGHFYINGYRYFTLGNVEAVNFNVT TAETTDFCTVALASYADVLV 242 

SEQ ID NO: 6057 PPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNLT TGDSDVFWTIAYTSYTEALV 472 

SEQ ID NO: 6061 PPTVREIVITKYGDVYWGFGYIJiLGLLDAWINFTGHGTDDDVSGFWTIASTNFVDALI 449 

SEQ ID NO: 6065 PLNVffil^TFSNaMFNMSSLMSFIQAYSFTCNNIDAA KIY6MCFSSITIDKFAIPN6 401 

SEQ ID NO: 6069 PENWERKTFQNCNFNLSSLLRYVQAESLFCNNIDAS KVYGRCFGSISVDKFAVPRS 400 

SEQ ID HO 1 6042 VyAnEItKRISNCVAD¥SVLYHSTFFSTFKCYaVSA.T KLNDLCFSNVYADSFWRGD 392 

SEQ ID NO: 6072 LVKQKFIVYRENSVNTTCTLHNFIFHNETGANPNPSG--VQNIQTYQTKTAQSGYYNFNF 309 

SEQ ID NO: 6053 NVSQTSIANIIYCNSVINRLRCDQLSFDVPDGFYSTSP— IQSVELPVSIVSLPVYHKHT 300 

SEQ ID NO: 6057 QVENTAITKVTYCNSHVNNIKCSQITANLNNGFYPVSS--SEVGLVNKSVVLLPSFYTHT 530 

SEQ ID NO: 6061 EVQGTSIQRILYCDDPVSQLKCSQVAFDLDDGFYPISSRNLLSHEQPISFVTLPSFNDHS 509 

SEQ ID NO: 6065 RKVDLQLGNLGYI<}SFNYRIDTTATSCQLYYNLPAANVS--VSRFNPSTWNIUl^ 459 

SEQ ID NO: 6069 RQVDLQLGNSGFLQTANYKIDTAATSCQLHYTLPKNNVT— INNHNPSSWNRRYGFNDAG 458 

SEQ ID 110:6042 DVRQIllPGQTGVIADYNYKLFDDFMGCVIAnNTRNin^^ STGNYNyKXKXIiSHG 446 

SEQ ID NO: 6072 SFLSSFVYKESNFMYGSYHPSCKFRLETINNGLWFNSLS VSIAYGPLQGGCKQS 363. 

SEQ ID NO: 6053 FIVLYVDFKPQ-SGGGKCFNCYPAGVNITLANFNET KGPLCVDTSHFTTKYVAVYAN 356 

SEQ ID NO: 6057 IVNITIGLGMKRSGYGQPIASTLS— NITLPMQDHN— TDVYCIRSDQFS-VYVHSTCK 584 

SEQ ID NO: 6061 FVNITVS AAFGGLSSANLVASDTTINGFSS FCVDTRQFTITLFYNVTN 557 

SEQ ID NO: 6065 VFKPQPAGVFTDHDWYAQHCFKASTNFCPCKLDGSLCVGNGPGIDAGYKTSGIGTCPAG 519 

SEQ ID NO: 6069 VFGKN QHDWYAQQCFTVRSSYCPC 483 

SEQ ID NO: 6042 KLRPFER 453 

SEQ ID NO: 6072 VFKGRAT ■—— 370 



SEQ ID NO: 6053 VGRWS ASINTGNCPFSFGKVNNFVKFGSVCFSLKDIPGGCAMPIVA 402 

SEQ ID NO: 6057 SALWDNIFKimCTDVLDATAVIKTGTCPFSFDKLNNYLTFNKFCLSLSPVGANCKPDVAA 644 

SEQ ID NO: 6061 SYGYVS KSQDSNCPFTLQSVNDYLSFSKFCVSTSLLAGACTIDLFG 603 

SEQ ID NO: 6065 TNYLTCHNAAQCDCLCTPDPITSKATGPYKCPQTKYLVGIGEHCSGLAIKSDHCG G 575 

SEQ ID NO: 6069 AQPDIVSPCTT— QTKPKSAFVNVGDHCEGLGVLEDNCGNADPH 525 

SEQ ID NO: 6042 DISNVPFSPDGKPCTPPALNCYWPLND 480 

SEQ ID NO: 6072 CCYAYSYGGPSLCKGVYSGELDHNFECGL 399 

SEQ ID NO: 6053 NWAYSKY YTIG SLYVSWSDGDGITGVPQPVEGVSSFMNVTLDKC 446 

SEQ ID NO: 6057 -RTRTNE QWR SLYVIYEEGDNTVGVPSDNSGVHDLSVLHLDSC 687 

SEQ ID NO: 6061 YPAFGSG VKLT SLYFQFTKGELITGTPKPLEGITD7SFMTLDVC 647 

SEQ ID NO: 6065 NPCTCQP---QAFI1GWSVDSCLQGDRCNIFANFILHDVNSGTTCSTDLQKSNTDIILGVC 632 

SEQ ID NO: 6069 KGCICAN NSFIGWSHDTCLVNDRCQIFANILLNGINSGTTCSTDLQLPNTEWTGIC 582 

SEQ ID NO: 6042 YGPYTOTGIGY QPYRVWLSFELLMAPATVCGPKLSTDLIKNQC 524 

SEQ ID NO: 6072 LVYVTKS GGSRIQTATEPPVITQNNYNNITLNTC 433 
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SEQ ID NO: 6053 TKYNIYDVSGVGVIRVSNDTFLN — -GITYTSTSGNLLGFKDVTK6TIYSITPC 497 

SEQ ID NO: 6057 TDYNIY6RTGVGIIRQTNRTLLS GLYYTSLSGDLLGFKNVSDGVIYSVTPC 738 

SEQ ID NO: 6061 TKYTIYGFKGEGIITLTNSSILA GVYYTSDSGQLLAFKNVTSGAVYSVTPC 698 

SEQ ID NO: 6065 VNYDLYGITGQGIFVEVNATYYNS WQNLLYDSNGNLYGFRDYLTNRTFMIRSC 685 

SEQ ID NO: 6069 VKYDLYGITGQGVFKEVKADYYNS WQTLLYDVNOILNGFRDLTTNKTYTIRSC 635 

SEQ ID 110:6042 VNFHFNGLTGT6VLTPSSKRPQP — PQQFGRDVSDFTDSVRDPKTSEILDISPC 576 

SEQ ID NO: 6072 VDYNIYGRTGQGFITNWDSAVSYNYIJ^AGIAILDTSGSIDIFWQGEYGIJilYYKVNPC 493 



SEQ ID NO: 6053 NPPDQLWYQQA—WGAMLSENFTSYGFSNWELPKFFYASNGTYN 542 

SEQ ID NO: 6057 DVSAQAAVIDGT—IVGAITSINSELLGLTHWTTTPNFYYYSIYNYTNDRTRGTAIDSND 796 

SEQ ID NO: 6061 SFSEQAAYVNDD— IVGVISSLSNS—TFNNTRELPGFFYHSNDGSN 741 

SEQ ID NO: 6065 YSGRVSAAFHAN— SSEPALLFRNIKCNYVFNNTLSRQLQPINYFDSYLGCWNADNSTS 743 

SEQ ID NO: 6069 YSGRVSAAFHKD— APEPALLYRNINCSYVFSNNISREENPLNYFDSYLGCWNADNRTD 693 

SEQ ID NO: 6042- AFG6VSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCL 636 

SEQ ID NO: 6072 EDVNQQFWSGGK-LVGILTSRNETGSQLLENQFYIKITNGTRRFRRSITEN 544 



SEQ ID NO: 6053 -CTDAVLTYSSFGVCADGS IIAVQPRNVSYDSVSAIVTAN LSI 584 

SEQ ID NO:.6057 VDCEPVITYSNIGVCKNGA FVFINVTHSDGD-VQPISTGN VTI 838 

SEQ ID NO: 6061 -CTEPVLVYSNIGVCKSGS IGYVPSQYGQVK-IAPTVTGN ISI 782 

SEQ ID NO: 6065 SWQTCDLTVGSGYCVDYSTKRRSRRSITTGYRFTNFEPFTVNSVNDSLEPVGGLYEIQI 803 

SEQ ID NO: 6069 EALPNCDLRMGAGLCVDYSKSRRADRSVSTGYRLTTFEPYTPMLVNDSVQSVDGLYEMQI 753 . 

SEQ ID NO: 6042 I6AEHVDTSYECDIPIGA6ICASYHTVSLLRSTSQKSIVAYTHSLGADSSIAYSHNTIAI 696 

SEQ ID NO: 6072 VANCPYVSYGKFCIKPDGS lATIVPKQLEQFVAPLFNVTEN ^VLI 588 

. . . * 

SEQ ID NO: 6053 PSNWTTSVQVEYLQITSTPIVVDCSTYVCNGNVRCVELLKQYTSACKTIEDALRNSARLE 644 

SEQ ID NO: 6057 PTNFTISVQVEYIQVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQTIEQALAMGARLE 898 

SEQ ID NO: 6061 PTNFSMSIRTEYLQLYNTPVSVDCATYVCNGNSRCKQLLTQYTAACKTIESALQLSARLE 842 

SEQ ID NO: 6065 PSEFTIGNMEEFIQTSSPKVTIDCSAFVCGDYAACKSQLVEYGSFCDNINAILTEVNELL 853 

SEQ ID NO:.6069 PTNFTIGHHEEFIQTRSPKVTIDCAAFVCGDNTACRQQLVEYGSFCVNVNAILNEVNNLL 813 

SEQ ID NO: 6042 PTHFSISlTTEVMPVSMJffiTSVDCNHYICGDSTECANIXLQYGSPCTQLNRALSGIAAEQ 756 

■ SEQ ID NO: 6072 PNSFNLTVTDEYIQTRMDKVQINCLQYVCGSSLDCRKLFQQYGPVCDNILSWNSVGQKE 648 
*. . : . * : ::*::*.. *.::*.*: : 

SEQ ID NO: 6053 SADVSEMLTFDKKAFTLANVSSFGDYN LSSVIPSLPTSGS 684 

SEQ ID NO: 6057 NMEVDSMLFVSENALKLASVEAFNSSETLDPIYKEWPNIGGSWLEGLKYILPSHNS 954 

SEQ ID NO: 6061 SVEVNSMLTISEEALQLATISSFNGDG YNFTNVLGASVYDPASGR— — 887 

SEQ ID NO: 6065 DTTQLQVANSLMNGVTLSTKLKDGVNFN VDDINFSPVLGCLGSDCN 909 

SEQ ID NO: 6069 DNMQLQVASALMQGVTISSRLPDGISGP IDDINFSPLLGCIGSTCAEDGN 863 

SEQ ID NO: 6042 DRNTREVFAQVKQMYKTPTLKYPGGFN - FSQILPDPLKPTK— - 796 

SEQ ID NO: 6072 DMELLNFYSSTKPAGFNTPVLSNVSTG EFNISLLLTNPSSRRK 691 

SEQ ID NO: 6053 — RVAGRSAIEDILFSKLVTSGLGTVDADYKKCTKGLS— lADLACAQYYNGIMVLPGV 739 

SEQ ID NO: 6057 KRKYRSAIEDLLFDKWTSGLGTVDEDYKRCTGGYD— lADLVCAQYYNGIMVLPGV 1009 

SEQ ID NO: 6061 WQKRSVIEDLLFNKVVTN6IjG'PVDEDYKRCSNGRS--VADLVCAQYYSGVMVLPGV 942 

SEQ ID NO: 6065 KVSSRSAIEDLLFSKVKLSDVG-FVEAYNNCTGGAE—IRDLICVQSYNGIKVLPPL 953 

SEQ ID NO: 6069 GPSAIRGRSAIEDLLFDKVKLSDVG-FVEAYNNCTGGQE— VRDLLCVQSFNGIKVLPPV 920 

SEQ ID NO: 6042 RSPIEDLI1FNKVTI1ADA6-FMKQYGECL6DIN--ARDLICAQKFN6LTVLPPL 846 

SEQ ID NO: 6072 RSLIEDLLFTSVESVGLP-TNDAYKNCTAGPLGFFKDLACAREYNGLLVLPPI 743 



BNSOOCID: <WOL__20O40923e0A2J_> 
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SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO t 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6 06 5 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 5069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



ADAERMMIYTGSLIGGIALGGLTS AVSIPFSLAIQARLNYVALOTDVLQENQKILA 795 

MftDKMTMYTASLAGGITLGALGGG AVAIPFAVAVQARLNYVALQTDVLNKNQQILA 1066 

VDAEKLHMYSASLIGGMALGGITA AAALPFSYAVQARLNYLALQTDVLQRNQQLLA 998 

LSENQISGYTLAATSASLFPPWSA AAGVPFYLNVQYRINGIGVTMDVLSQNQKLIA 1019 

LSESQISGYTTGATAAAMFPPWSA AAGVPFSLSVQYRINGL6VTMNVLSENQKMIA 976 

LTDDHIAAYTAALVSGTATAGWTFGAGAALQIPFAHQMAYItfNGIGVTQNVLYENQKQIA 906 
ITAEMQALYTSSLVASMAPGGITA AGAIPFA!PQLQARINHLGITQSLLLKNQEKIA 799 



ASFNKAMTNIVDAFTGVNDAITQTSQALQTVATALNKIQDVVNQQGNSIJraLTSQLRQNF 855 
SAFNQAIGNITQSP6KVNDAIHQTSR6LATVAKALAKVQDWNIQGQALSHLTVQLQNNF 1126 
ESFNSAIGNITSAFESVKEAISQTSKGUm^mALTKVQEVVNSQGSALNQLTVQLQHNF 1058 

NAFNNALGAIQEGFDATN SALVKIQAWNANAEALNNLLQQLSNRF 1065 

SAFNNALGAIQDGFDATN- SALGKIQSWNANAEALNNLLNQLSNRF 1022 

NQPNKAISQIQESLTTTS TALGKLQDWNQNAQALNTLVKQLSSNF 952 

ASFNKAIGHMQEGFRSTS LALQQIQDWSKQSAILTETMASLNKNF 845 



QAISSSIQAIYDRLDTIQADQQVDRLITGRLAAIJWFVSHTLTKYTEVRASRQLAQQKVN 915 
QAISSSISDIYNRLDELSADAQVDRLITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVN 1186 
QAISSSIDDIYSRLDILSADVQVDRLITGRLSALNAFVAQTLTKYTEVQASRKLAQQKVN 1118 
GAISSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 1 12 5 
GAISASLQEILTRLEAVEAKAQIDRLINGRLTALNAYISKQLSDSTLIKVSAAQAIEKVN 1082 
GAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMS 1012 
GAISSVIQEIYQQFDAIQANAQVDRLITGRLSSLSVLASAKQAEYIRVSQQRELATQKIN 905 



ECVKSQSKRYGFCG-NGTHIFSIVNAAPEGLVFLHTVLLPTQYKDVEAWSGLCVDG 970 

ECVRSQSQRFGFCG-NGTHLFSLANAAPNGMIFFHTVLLPTAYETVTAWPGICASDG-DR 1244 

ECVKSQSQRYGFCGGDGEHIFSLVQAAPQGLLFLHTVLVPGDFVNVLAIAGLCVNG 1174 

ECVKSQSSRINFCG-NGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAG 1180 

ECVKSQTTRINFCG-NGNHILSLVQNAPYGLYFIHFSYVPISFTTANVSPGLCISG 1137 

ECVL6QSKRVDFC6-K6YHIiMSFPQAAPH6\nnS'IiHVTYVPSQEIU9FTTAPAICaB6- 1067 

ECVKSQSIRYSFCG-NGRHVLTIPQNAPNGIWIHFSYTPDSFVNVTAIVGFCVKPANAS 964 

*** *. * *** * *:::; ; ** *; *:* * , 

TNGYVLRQPNLALYK EGNYYRITSRIMFEPRIPTMADFVQIENCNVTFVNISRS 1024 

TFGLWKDVQLTLFRN LDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATVS 1299 

EIALTLREPGLVLFTHELQTYTATEYFVSSRRMFEPRKPTVSDFVQIESCWrYVNLTSD 1234 

DRGIAPKSGYFVNVNN TWMFTGSGYYYPEPITGNNWVMSTCAVNYTKAPDV 1232 

DRGLAPKAGYFVQDDG EWKFTGSSYYYPEPITDKNSVIMSSCAVNYTKAPEV 1189 

-KAYFPREGVFVFNGT SWFITQRMFFSPQIITTDNTFVSGNCDWIGIINNT 1118 

QYAIVPANGR6IFIQVN GSYYITARDMYMPRAITAGDVVTLTSCQANYVSVNKT 1018 



ELQTIVP-EYIDVNKTLQELSYKL-PNYTVPDLVVEQYNQ'PILNLTSEXSTLENKSAELN 1082 
DLPSIIP-DYIDINQTVQDILENFRPNWTVPELTFDIFNATYLNLTGEIDDLEFRSEKLH 1358 
QLPDVIP-DYIDVNKTLDEILASL-PNRTGPSLPLDVFNATYLNLTGEIADLEQRSESLR 1292 

MLNISTP-NLPYFKEELDQWFKNQTSVAPDLSLDY—INVTFLDLQDEMN 1279 

FLNTSIP-NPPDFKEELDKWFKNQTSIAPDLSLDFEKLNVTLLDLTYEMN 1238 

VYDPLQP-ELDSFKEELDKYFKNHTSPDVDLGDISG-INASWNIQKEID 1166 

VITTFVDNDDFDFNDELSKWWNDTKHELPDFDKFN- -YTVPILDIDSEID 1066 
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SEQ ID NO: 6053 YWQKLQTLIDlHNSTLVDLKlflMJRVETYIKWPW^^ 1142 
SEQ ID NO: 6057 NTTVEIAILIDNINNTLVNLEVttJ3RIETYVKWPWYmLI6LWIFCIPLIJ.FC^ 1418 
SEQ ID NO: 6061 NTTEELRSLINNIIOTLVDLEWIMIWYIKWPWWVWLIIVI^ 1352 

SEQ ID NO: 6065 RLQEAIKVLMQSYINLKDIGTYEYYVKWPWYVWLLI6PAGVAMLVLLFFICCCTGC 1335 

SEQ ID NO: 6069 RIQDAIKKLNESYINLKEVGTYEMYVKWPWYVWLLIGLAGVAVCVLLFFICCCTGC 1294 

SEQ ID NO: 6042 RLNEVAIOnjffiSLIDLQELGKYEQyiiaWWYVWLGFIAGLIAIVMTO 1222 

SEQ ID NO: 6072 RIQGVIQGLNDSLIDLEKLSILKTOIKWPWYVWLAIAFATIIFILIl^WVFFMTGC 1122 

.: .:*.:::*::. :*;****.***. . . ** 

SEQ ID NO: 6053 CGFFSCFA SSIRGCCESTKLPYYD-VEKIHIQ— 1173 

SEQ ID NO: 6057 CGCIGCLG SCCHSICSRRQFENYEPIEKVHVH 1450 

SEQ ID NO: 6061 CGCCGCCG ACFSGCCRGPRLQPYEAFEKVHVQ— 1384 

SEQ ID NO: 6065 G~TSCFK KCGGCCDDYTGHQELVIKT SHED- 1364 

SEQ ID NO: 6069 G~SCCFK KCGNCCDEYGGHQDSIVIHNISSHED- 1326 

SEQ ID 190:6042 CSCIiKG2^ SC6-SCCKFDEDDSEPVLKGVKLHYT- 1255 

SEQ ID NO: 6072 CGCCCGCFGIMPLMSKCGKKSSYYTTFDNDWTEQYRPKKSV 1164 

FIGURE 7B 

SEQ ID NO: 6054 MPLIObVDDHA-LVVNVLLWCWLIVILLVCITIIKLIKLCFTCHMFCNRTVY 51 

SEQ ID NO: 6062 MLQLVNDNG-LVVNVILVJLFVLFFLLIISITFVQLVMLCFTCHRLCNSAVY 54 

.SEQ ID NO: 6058. MTFPRALTVIDDNG-MVINIIFWFLLIIILILLSIALLNIIKLCMVCCNLGRTVII 59 

SEQ ID NO: 6045 MYSFVSEET6TLIVNSVLLFIiAFVVFLLVT]JU:LTALRLCa.YCCNIVN^ 52 

SEQ ID NO: 6073 MNLLNKSLEENG-SFLTALYIIVGFLALYLLGRALQAFVQAADACCLFWYTWW 57 

SEQ ID. NO: 6066 MFMADAYFADTVWYVGQIIFIVAICLLVIIVWAFIATFKI^IQLCGMCNTLVL 54- 

SEQ ID NO: 6054 GP IKNVYH-IY-QSYMH IDPF PKRVIDF 77 

SEQ ID NO: 6062 TP IGRLYR-VY-KSYMR- IDPL PSTVIDV- 80 

SEQ ID NO:.6058 VP AQHAYD-AY-KNFMR IKAYN PDGALLA 86 

SEQ ID NO: 6045 KP TVYVYS-RV-KNLNS SEGV PDLLV— 76 

SEQ ID NO: 6073 IPGAKGTAFVYKYTYGRKLMPELEAVIVNEFPKNGWNKKNPANFQDAQRDKLYS 112 

SEQ ID NO: 6066 SP SIYVFN-RG-RQFYEF YNDVKP PVLDVDDV 84 



FIGURE 7C 

SEQ ID NO: 6055 MSNDNC TGDIVTHLKNWNF 19 

SEQ ID NO: 6063 MSNGSIP VDEVIEHLRNWNF 24 

SEQ ID NO: 6059 MKILLILACVIACACGERYCAMKSDTDLSCRNSTASDCESCFNGGDLIWHLANWNF 60 

SEQ ID NO: 6067 MSSVT-TPAPVYTWT ADEAIKFLKEWNF 27 

SEQ ID NO: 6070 — ■. — —MSSTTQAPEPVYQWT ADEAVQFLKEWNF 33 

SEQ ID NO: 6046 MADUGTIT VEELKQLLEQWHL 21 

SEQ ID NO: 6074 MPNETNCTLD FEQSVQLFKEYNL 28 

SEQ ID NO: 6055 GWNVILTIFIVILQFGHYKYSRLFYGLKMLVLWLLWPLVLALSIFDTWANWDSN-WAFVA 78 

SEQ ID NO: 6063 TWNIILTILLVVLQYGHYKYSWLYGVKMAILWILWPLVIJU:,SLFDAWASFQVN-WVFFA 83 

SEQ ID NO: 6059 SWSIILIVFITVLQYGRPQFSWFVYGIKMLIMWLLWPVVLALTIFNAYSEYQVSRYVMFG 120 

SEQ ID NO: 6067 SLGIILLFIWILQF6YTSRSMFVYVIKWILWiMWPLTIILTIFN--CVYALN-NVYLG 84 

SEQ ID NO: 6070 SLGIILLFITIILQFGYTSRSMFIYVVKMIILWI1MWPLTIVI.CIFN--CWAIJI-NVYW 90 

SEQ ID 110:6046 VIGPLFIAWim.LQPAYSHRNRPLYIIKLVFLWLLWPVTLACT'VIA- 78 
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SEQ ID NO: 6074 FITAFLLFLTIILQYGYATRSKViyTLKMIVLWCFWPIiNIAVGVIS--CTYPPN-TGGLV 85 



SEQ ID NO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID NO: 6067 
SEQ ID NO: 6070 
SEQ ID NO: 6046 
SEQ ID NO: 6074 



FSFFMAVSTLVMWVMYFANSFRLFRRMTFWAWNPEVNAITVTTVL-GQTYYQPIQQA^ 137 
FSILMACITLMLWIMYFWSIRLWRRTHSVMSFNPETDALLTTSVM-GRQVCIPVLGAra 142 
FSIAGAIVTFVLWIMYFVRSIQLYRRTKSWWSFNPETKAILCVSAL-GRSYVLPLEGVPT 179 
FSIWTmilMmPVNSIRIJIRTGSWWSFNPETNNIjMCIDMK-GRMmPIIEDYH 143 
PSIVFTIVSIVIWIMyPVNSIRLFIRTGSWWSFNPETNNIiMCIDMK-GTVYVRPIIEDYH 149 
IAlAMACIVGLMWLSYFVASFRLFARTRSMWSFNPETNILIiNVPIiR-GTr\^ 137 
AAIILTVFACLSFVGYWIQSIRLFKRCRSWWSFNPESNAVGSILLTNGQQCNFAIESVPS 145 



SEQ ID NO: 6055 GITVTLLSGVLYVDGHRLASGVQVHNLPEYMTVAVPSTTIIYSRVGRSVNSQNSTG~WV 195 

SEQ ID NO: 6063 GVTLTLLSGTLLVEGYKVATGVQVSQLPNFVTVAKATTTIVYGRVGRSVNASSGTG--WA 200 

SEQ ID NO:6059 GVTLTLLSGNLYAEGFKIAGGMNIDNLPKYVMVALPSRTIVYTLVGKKLKASSAT6--WA 237 

SEQ ID NO: 6067 TLTVTIIRGHLYMQGIKLGTGYSLSDLPAYVTVAKVSHLLTYKRG-FLDKIGDTSG— FA 200 

SEQ ID NO: 6070 TLTATIIRGHLYMQGVKLGTGFSLSDLPAYVTVAKVSHLCTYKRA-FLDKVDGVSG— PA 206 

SEQ ID I9Ot6046 VIGAVIIRGHLRHAGHSLGR-CDIKDLPKBITVATS-RTLSYYKLGASQRVGTDSG— FA 193 

SEQ ID NO: 6074 VLSPIIKNGVLYCEGQWLAK-CEPDHLPKDIFVCTPDRRNIYRMVQKYTGDQSGNKKRFA 204 



SEQ ID NO: 6055. FYVRVKHGDFSAVSSPMSNMTENERLLHFF 225 

SEQ ID NO: 6063 FYVRSKHGDYSAVSNPSAVLTDSEKVLHLV 230 

SEQ ID NO: 6059 YYVKSKAGDYSTEAR-TDNLSEQEKLLHMV 266 

SEQ ID NO : 6 0 67 VYVKSKVGNYRLPSTQKGSGLDTALLRNNI 230 

SEQ ID NO: 6070 VYVKSKVGNYRLPSN-KPSGADTALLR~I 233 

SEQ ID NO: 6046 AYNRYRIGIilYKLIITDHAGSNDNIALLVQ-- 221 

SEQ ID NO: 6074 TFVYAKQSVDTGELESVATGGSSLYT 230 



FIGURE 7D 



SEQ ID NO: 6056 MATVKWADASE PQRGRQG 18 

SEQ ID NO: 6064 MASVSP QDRGRK 17 

SEQ ID NO: 6060 MANQGQRVSWGDEST KTRGRSNSRGRKN 31 

SEQ ID NO: 6068 MSFTPGKQSSS-RASSGNRSGNGILK— WADQSDQSRNVQTRGRR-AQPKQTATSQQPS 55 

SEQ ID NO: 6071 MSFVPGQENAGGRSSSVNRAGNGILKKTTWADQTERGPNNQNRGRR-NQPKQTATTQ-PN 58 

SEQ ID 110:6051 MSDNGPQSNQRSAPRITFGGPTDSTDNNQNG6RNGARPKQRRPQGLPN 48 

SEQ ID NO: 6075 MASGKAAGKTDAPAPVIKLGGPKP PKVGSS 35 

* 

SEQ ID NO: 6056 RIPYSLYSPLLVDSEQPW-KVIPRNLVPINKK-DKNKLIGYWN— VQKRFRTRKGK 70 

SEQ ID NO: 6064 RVPLSLYAPLRWNDKPLSKVLANNAVPTNKG-NKDQQIGYWN--EQIRWR101RGE 70 

SEQ ID NO: 6060 NNIPLSFFNPITLQQGSKFWNLCPRDFVPKGIG-NRDQQIGYWN—RQTRYRMVKGQ 85 

SEQ ID NO: 6068 GGNWPYYSWPSGITQFQKGKEFEPAEGQGVPIAPGVPATEAKGYWYRHNRRSFKTADGN 115 

SEQ ID NO: 6071 SGSWPHYSWFSGITQFQKGKEFQFAEGQGVPIANGIPASEQKGYWYRHNRRSPKTPDGQ 118 

SEQ ID NO: 6051 NT ASWFTALTQHGK-EELRFPRGQGVPIOTMSGPDDQIGYYRHATRR-VRGGDGK 101 

SEQ ID NO: 6075 -----6NASWFQAIKAKKI1NTPPPKFEGSGVPDNENIKPSQQHGYWR--RQARFKPGKGG 88 
* : : . ** . **: : • * 

SEQ ID NO: 6056 RVDLSPKLHFYYLGTGPHKDAKFRERVEGWWVAVDGAKTEPTGYGVRRKNSEPEIPHFN 130 

SEQ ID NO: 6064 RIEQPSNWHFYYLGTGPHGDIJIYRTRTEGVFWVAKEGAKTEPTNLGVRKASEKPIIPKFS 130 

SEQ ID NO: 6060 RKELPERWFFYYLGTGPHADAKFKDKLDGVVWVAKDGAMNKPTTLGSRGANNESKALKFD 145 

SEQ ID NO: 6068 QRQLLPRWYFYYLGTGPHAKDQYGTDIDGVYWVASNQADVNTPADILDRDPSSD— EAIP 173 
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SEQ ID NO: 6071 QKQLLPRMFYYLGTGPHAGASYGDSIEGVFWMSQADTNTRSDIVERDPSSH--EAIP 176 

SEQ ID NO: 6051 MKELSPRWYFYYLGTGPEASLPYGMIKE6IVWVaTEGAlJJTPKDHIGrRNPMNN--AAW 159 

SEQ ID NO: 6075 RKPVPDAWYFyYTGTGPAADIiNWGDTQDGIVWVAAKGADTKSRSNQGTRDPDKF--DQYP 146 
^*** **** . .*. _ * 

SEQ ID NO: 6056 QKLPNGVTWEE— PDSRAPSRSQSR SQSRGRGESK 164 

SEQ ID NO : 6064 QQLPSWEIVEPNTPPASRANSRSRSRGNGNNRSRSPSNNRGNNQSRGNSQNRGMQGRG 190 

SEQ ID NO: 6060 6KVPGEFQLEVN QSRDNSRSRSQ SRSRSRNR 176 

SEQ ID NO: 6068 TRFPPGTVLPQGYYIEGS-GRSAPNSR STSRASSRASS 210 

SEQ ID NO: 6071 TRFAPGTVLPQGFYVEGS-GRSAPASR SGSRSQSRGP 212 

SEQ ID 110:6051 LQLPQGTTLPKGPYAEGSRGGSQASSR SSSRSR6NSR 196 

SEQ ID NO: 6075 LRFSDGGPDGNFRWDFIPLNRGRSGRS TAASSAAASR 183 

SEQ ID NO: 6056 PQSRNPSSDRNHN SQDDIMKAVAAALKSLGFDKPQEKDKKS 205 

SEQ ID NO: 6064 ASQNRGGNNNNNNKSRNQSNNWJQSNDRGGVTSRDDLVAAVKDALKSLGIGENPDRHKQ- 249 

SEQ ID NO: 6060 SQSRGRQQFNNKK DDSVEQAVLAALKKLGVDTEKQQQRS- 215 

SEQ ID NO: 6068 AGSRSRANSGNRT PTSGVTPDMADQIASLVLAKLGKDAAKP 251 

SEQ ID NO: 6071 NNRARSSSNQRQ PASTVKPDMAEEIAALVLAKLGKDAGQP 252 

SEQ ID NO: 6051 - NSTPGSSRGNS PARMASGG6ETALALLLLDRLNQLESKV 235 

SEQ ID NO: 6075 APSREGSRGRR SDSGDDLLARAAKIIQDQ 212 

SEQ ID NO: 6056 AKTGTPKPSRNQSPASSQTSAKSLARSQSSETKEQIOJEMQKPRWKRQPNDDVTSNVTQCF 265 

SEQ ID NO: 6064 QQKPKQEKSDNSGKNTPK KNKSRATSKERDLKDIPEWRRIPKG— ENSVAACF 300 

SEQ ID NO: 6060 RSKSKERSNSKTRDTTPK NENKHTWKRTAGK— GDVTRFY 253 

SEQ ID NO: 6068 QQVTKQTAKEIRQKILN KPRQKRSPNK— QCTVQQCF 285 

SEQ ID NO: 6071 KQVTKQSAKEVRQKILN KPRQKRTPNK—QCPVQQCF 287 

SEQ ID NO: 6051 SGKGQQQQGQTVTKKSAAEASK -KPRQKRTATK—QTnHVTQAF 275 

SEQ ID NO: 6075 QKKGSRITKAKADEMAH RRYCKRTIPP—NYRVDQVF 247 

: • : ; * * : 

SEQ ID NO: 6056 GPRDLDH NFGSAGWAN6VKAKGYPQFAELVPSTAAMLFDSHIVSKESG 314 

SEQ ID NO: 6064 GPRGGFK NFGDAEFVEKGVDASGYAQIASLAPNVAALLPGGNVAVRELA— 349 

SEQ ID NO: 6060 GARSSSA— NFGDTDLVANGSSAKHYPQLAECVPSVSSILFGSYWTSKEDG 302 

SEQ ID NO: 6068 GKRGPNQ-— NFGGGEMLKLGTSDPQFPILAELAPTAGAFFFGSRLELAKVQNLSGNLDE 343 

SEQ ID NO: 6071 GKRGPNQ NFGGSEMLKLGTSDPQFPILAELAPTVGAFFFGSKLELVKKN— SGGADE 342 

SEQ ID NO: 6051 GRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTP 327 

SEQ ID NO: 6075 GPRTKGK-EGNFGDDKMNEEGIKDGRVTAMLNLVPSSHACLFGSRVTPKLQL 298 

* * ***^ * . : , .* • • 

SEQ ID NO: 6056 NTWLTFTTRVTVPKDHP HLGKFLEELNAFTR EMQ 349 

SEQ ID NO: 6064 — DSYEITYNYKMTVPKSDP NVELLVSQVDAFKTGNAK-LQRKKEKKNKRETTLQ 401 

SEQ ID NO: 6060 — DQIEVTFTHKYHLPKDDP KTGQFLQQINAYAR PSEVAKEQR 343 

SEQ ID NO: 6068 PQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQD GTMNMSPKPQRQ— R 397 

SEQ ID NO: 6071 PTKDVYELQYSGAVRFDSTLPGFETIMKVLNENLNAYQKDG GADWSPKPQRKGRR 398 

SEQ ID NO: 6051 — SGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFP PTE 368 

SEQ ID NO: 6075 — DGLHLRFEFTTWPCDDPQFDNYVKICDQCVDGVGTRPKDDEPKPKSRSSSRPATRG 355 

SEQ ID NO: 6056 QHPLLNPSALEFNPSQTSP ATAEPVRDEVSIETDIIDEVN 389 

SEQ ID NO: 6064 ' QHEEAIYDDVGAPSDVTHANLEWDTAVDGGDTAVEIINEIFDTGN 446 

SEQ ID NO: 6060 KRKSRSKSAERSEQDWPDALIENYTDVFDDTQVEIIDEVTN 385 

SEQ ID NO: 6068 6QKNGQGENDNISVAAPKSRVQQNKIRELTAEDISLLKKMDEP FTEDTSEI— 448 

SEQ ID NO: 6071 QAQEKKDEVDNVSVAKPKSSVQRNVSRELTPEDRSLLAQILDDGWPDGLEDDSNV 454 
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SEQ ID NO s 6051 -PKKDKKKKPDEAQPLPQRQKKQPTVTLLPAA r- 399 

SEQ ID NO: 6075 NSPAPRQQRPKKEKKIiKKQDDEADKALTSDEERlMAQLEFYDEPK\n:NWGDAALGENEL 414 



FIGURE 7E 



FIGURE 7F 
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FIGURE 7H 
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FtGURES 



avian IBV partial 5'UTR 161- 



HCoV-OCMS 5'UTR (1) - 
(D- 
(1) 



— Section 1 

(1)1 ,10 20 30 42 

(1) TATTAAAATCTTATTGTTGCTGGTATCACTGCTTGTTTTGCC 



bovine CV 5'UTR 
Consensus 



(43) 43_ 



- Section 2 
- 84 



avian IBV partial 5*UTR 161- (43) gtgtctcactttatacatctgttgcttgggctacctagtgtc 
HCOV-OC43 5'UTR (1) - 
bovine CV 5'UTR (1) - 
(43) 



(85) 86_ 



_,90 ,iqo_ 



^ j1 10__ 



- Section 3 
126 



avian IBV partial 6'UTR 161- (85) cagcgtcctacgggcgtcgtggctggttcgagtgcgaggaac 
HCOV-OC43 5'UTR (1) - 
bovine CV 5'UTR (1) - 

Consensus (85) 



Section 4 



(127) 

avian IBV partial 5'UTR 161- (127) 
HCOV-OC43 5'UTR (1) 
bovine CV 5'UTR (1) 
Consensus (127) 



(169) 

avian IBV partial 5'UTR 161- (169) 
HCOV-.OC43 6'UTR (33) 
bovine CV 5'UTR (33) 
Consensus (169) 




(211) 

avian IBV partial 5'UTR 161- (209) 
HCoV-OC43 5'UTR (68) 
bovine CV 5'UTR (68) 
Consensus (211) 



teaj^iG^O^G Tl?>i;AAqjA - - C G G G G^G^C 



WiM pT'6A$'d'.^fO^>GJi;^f^^dS'^ 

T-rCA CTGATCTCf TGTTAGATCTTTTCGTAATCTA 

Section 6 

211 220 ,i230 ^ ^ 240 ^52 

, - A T C T.AffiG g|t {Pos-rrGQ 

AACTTTATAAAAACATCCACTCCCTGTAGTCTATGCCTGTGG 

Section 7 



(253) 

avian IBV partial 5'UTR 161- (261) 
HCoV-OC43 5'UTR (1 10) 
bovine CV 5'UTR (110) 
Consensus (253) 



253 .^J260^. 270 280 294 

& CT A C G"TrT,;C-T C G C A^'^i A'A GG T C G Gjt T^f fllc G A'CG.T T X GT AGIg G 
jGCGT AGA^'i'TTT TCA^IAGTGGTGTTgrA,!' A[r T-CAT TTCT -GCT 
gpG ^A^A'Ti'T TT T C7i.T AG T G G T GT C.T AT T - 'c'aT T "JT-C T -G C T 
GCGT AGAT T T T TCATAG T GGTGT CTATAT T CAT T TC T GC T 

_ _ Section 8 

(295) 295.._^0p. .310 .320 336 

avian IBV partial 5'UTR 161- (293) PGi^GTGCpAAAC^Afei^ficc'T 

HCoV-OC43 5'UTR (150) feTTAm CAGCTT'T^^^ 

bovine CV 5'UTR (150) ||qc^^(?i&i^fc^rfs^<^jgg^ 

Consensus (295) G T T AAC AGO T T T C AG C CAG G G AC GT G T T G T AT C C T AG G C 



(337) 337 350 ^60 373 

SEQ ID NO: 9910 • (335) TTT!AG.T"<^AiGCiAGAG^^ 

SEQ ID NO: 9919 : (189) i?VGTG|-5&e;c!iCGievG^ 

SEQ ID NO: 9892 : (189) i?!5JrG;-|GtteCAicid;ba^^ 

consensus (337) agtg' gcccacccataggtcacaatg 
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SEQ ID NO: 

'^■^^^i^^A-f^Gf GTG (136.154nt) 6021 

F2: GTG^GTG^AT^C|CTTCA (152-172 nt) 6022 

F3: CTTCACf G jTCTfn-GT^A (168-195nt) 6023 

'H"^^'^^^^-^^^^ (307^29nt) 6024 

R2:AAA^G^TATA||c^C^TATG (265-288nt) 6025 

''3:C^^TATG^fAfc-^GCCCA (250-274nt) 6026 
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FKGURE 10 



Section 1 

(1)1 ,10 ^20 36 

avian IBV3'UTR(NC_001451) 27103- (1) GTAACATAATGGACCTGTTGTTTCCTGGTACATTTT 

HCOV-OC43 3'UTR partial (1) 

bovine CVS'UTR (1) 

Consensus (1) • ^ _ 

■ _ Section 2 

(37) 37 §0 §0 72 

avian IBV 3'UTR (NC_001451) 27103- (37) GTTAAACACTATTTCTGTGCTTTCCTATCAATTATT 

HCOV-OC43 3'UTR partial (1) 

bovine CV 3'UTR (1) 

Consensus (37) 

Section 3 

(73) 73 ,80 go 108 

avian IBV 3'UTR (NC_001451) 27103- (73) ACAGGCATTGATTGTGATTATGTTCAATACTTAAGC 

HCoV-OC43 3'UTR partial (1) 

bovine CV 3'UTR (1) ^ 

Consensus (73) 

Section 4 

(109) 109 _ J2g^ ^30 _ 144 

avian IBV 3'UTR (NC_001451) 27103- (109) TTCTTTTGGTTGCTTTTTGCTTATTGTATTGTTGCT 

HCOV-OC43 3'UTR partial (1) 

bovine CVS'UTR (1) 

Consensus (109) 

- Section 5 

(145) 145 ^150 160.^ .1"^°^ . . . 1§0 

avian IBV 3'UTR (NC_001451) 27103- (145) GTGCTTTTTATTGTTGTGATTCTciiTT^TT^^ 

HCOV-OC43 3'UTR partial (1) TT^k'G'^lflMssM 

bovine CV 3'UTR (1) - Gp^At^AA^ 

Consensus (145) A GAGAATGAAC 

Section 6 

(181) 181_ .190 gpq 216 

avian IBV 3'UTR (NCJ001451) 27103- (179) T|TJ^¥,cle^q3;Aj|AA§TTCAApA!ii;ASGAGTlAA^ 
HCoV^C43 3'UTR partial (14) t^i^-|c|l9^g|ijCCX^ 
bovine CV 3'UTR (1 1) ^|f||^ij-^(:^^c:iA|^ 

Consensus (181) 'CT TAT gtcggcacctggtggtaacccctc gcagg 

— - - — - ~- - —Section 7 

(217)217 230_ J240 ^ _252 

avian IBV 3'UTR (NC 001451) 27103- (215) ^TpGjglATiTAXSCTTG^TlicCTS^fgGi^^ 

HCoV<OC43 3'UTR partial (48) 'pA^^MsM iS^jEaffl^^Sfe!^ 

bovine CV 3*UTR (45) W^'^P''^ ^l^lfei^^fe^i^j^^^'G 

Consensus (217) AAAGTCGGG ATAAGGCAC TCTCTATCAG 
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FIGURE 10 (contd.) 



(253) 253 

avian IBV3'UTR (NC_001451) 27103- (251) cMgW: 

HCoV-OC43 3'UTR partial ^ " 

bovine CV 3'UtR ^^^^ 
Consensus (253) AATGGA 



^0 



- Section 8 



'251) CjGp^A^^p^AATC T G^CjE^^CpTAG^A^SCC TgI 

(73) l@2Tag^--g^Ga?!T;GC^g?^Clp^ fe 

'253^ AATfif=a rpf-^'r^-.r,„,r~A^-^^,;;t^-^^^ » 



TGTCTTGCTGCTATAATAGATAGA G 

— — Section 9 

.30 0 310 324 



avian IBV 3'UTR (NC_001451) 27103- 

HCOV-OC43 3'UTR partial ^lu/; wm^-^Ml^m{W:m^!^M:>m^i~^&mM 

bovineCV 3'UTR (10 J ii6«#^iGfe^|^-^^---P^|T5>i^^ 
Consensus (289) AAGGTTATAGCAGACTAT AGATT AAT'ifAGTTG 

~~ ~ ■ — ■ — Section 10 

(325) 325 ,330 340 350 -^Rn 

avian IBV (NC^MSI^ayiOS- (323) pTT^lAi^Tf A^TTA^ 

Consensus (325) AAAGfTTTGTGTGGTAATGT^ 

~" —Section 11 



(361) 361 

avian IBV 3'UTR (NC_00i451) 27103- (358) " 

HCOV-OC43 3'UTR partial (175) f^- 
bovine CV 3'UTR (1 72) 



----- 1 — / KSfsasSiaS;?'' 
Consensus (361) TG AAAGACT 




(397) 397 

avian IBV 3*UTR (NC_001451) 27103- (393) GAT^G^Gfei 
HCOV-OC43 3'UTR partial (206) ^t'^'&C:laA| 

bovineCV 3'UTR (203) irofie^^^^p^PE^iTS- 
Consensus (397) TGCCCAAGGGGAAGAGCCAGCACG 



T GO G G AAG TAA-T T GCC GACAAis 
Section 12 



(433) 433 



TTAAGT-TA 
— Section 13 
468 



avian IBV 3'UTR (NCL001451) 27103- (429) a^G^Tig^T^^igg^tj^--'^ 
i\ (238) p;5?iicEiiaGa^A^^'feft:^,!l5 



HCOV-OC43 3'UTR partial (koo; ^-m^^-smMS^-AQia^-PiS^, 

bovineCV 3'UTR g35) ^iTi^Sg^^^^i^^j^j^^^i^^ 

Consen sus (433) CCA CCAGTAATTAGTAAATGAATGAA^AATTAT 

- — Section 14 

,480 .490 504 



(469) 469 480 490 1504 

HCOV-OC43 3 UTR partial (274) GGGGAASia^GG^GAATC Aqi 

bovineCV 3'UTR (271) M^^J?ie"<^^>jT3c^- 



(505) 505_ 513 

avian IBV 3'UTR (NC_001451) 27103- (498) GTTAGAGCa " SEQ ID NO: 9911 

HC0V-OC43 3'UTR partial (293) SEQ ID NO: 9920 

bovine CV 3'UTR (290) SEQ ID NO: 9893 

Consensus (505) 
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PGT/US2004/011710 



F-1 tctatc-|§|a|ggatgtct 



(245 ~ 265 nt) 



SEQ ID NO: 
6027 



F-2 TTAGHT J AA-^TTT ~ GT J ^ f (3^8" 339 "*) 



6028 



F-3 T AGTGTT^ GAG ^ A ^ GT~ TAAAG A ( 346 ~ 368 nt) 



R-1 A^ irf GCCATA ^ AACTT 

C A T A 



(458 ~ 476 nt) 



6029 
6030 



R.2ACTAA^TfGfrfT^cXTAA (426 -448nt) 



"-^ ^ "S^^l^^^-S^^f ^^'^ < ~ 395 nt) 



6031 

6032 
6033 



FIGURE 12 
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using Miral nembrane- fusion -table for pep4 




1a 



1a 



la 



~7r 



la 



FiGURE 13 



1b 



lb 



EM N 



h: 



EM N 



lb 



M N 



lb 



E M 



HH^SZZh 



// S> EM N 
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FIGURE 14 




EM N 



h5 — 



E M 

l-O-dl — \sm 



FIGURE IS 





SEQ ID NO: 6569 

1 sgf rkmaf ps gkvegcmvqv fe^Vtttlngl wlddtvycpr [^ictaedml npnyecailir 

61 fcsnlisf Ivqa gnvqlrvigh smgincllrlk vdtsnpfctpk ykfvriqpgq tfsvlacyng 

121 spsgvyqcam r pnht: ifcg j£|l n.ggcbsvg£ nldydcvsfc fyjipljmielptg vhagbdlegk 

181 fygpfvdrqt aqaagtdttx'k;Uivlatsrlya avingdnir£Ji>m?Etttlndf nlvaulcynye 

241 pltqdhvdil gplsaqtgia vl^ti«^alke llqngmng^yilgstilede ftpfdwrqc 

301 sgvtfq 



Key residues of the substrate site 
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FIGURE i6 



■ : (1) 

avian 88V nsp2 (1) 

iVIHV nsp2 (1) 

SARSnsp2 (1) 

BCoVnsp2 (1) 

Con^nsus (1) 



20 



30 



- Section 1 
51 



VICSA§DMifN 



<62) 

avian IBV nsp2 (49) 
IWHVnsp2 (52) 
. SARS!ilSp2 (52) 
BCoVmspa (52) 
Consensus <52) 



(103) 

av3an1BV.nsp2<100i 
WHVmp2<'E02) 
SARS nsp2 <102) 
BC6ynsp2<102) 
Consensus (103) 



SCgVKMVSeisK^'EFCIVSVTyGNIITLOGLlfLiQmCV^^^ 
SGFRKmPplGKVEGCi^^QVTCGTTTLBIGLWLTDDlVYCeP H . 

SGIVKMVSPS SKVEPC I VSVT YGin5TLKGl.WE.bDTVYC PRHV I C SAADHt-I^ 

— — Section 2 

§z gig 20 .80 50 tog 

pp.'arp^LCRVT^iprc -i^ i.'svm'imQ^CQmhW'Ti.Qm pmi p k v 

^N,5FE 1? L L I RKpWH S FX VQiij|Er - VQ IeR"/8? iSMaRcSCiR L K VDT S N P FI^t Y 
FDY WlitiCRVTSSDP VLSGR VSLTVMSYQMQGCLLVLTVTLQKPitTPKy 

— — ■■ — — — '■ — ■■ ' — Section 3 

103 .110 .120 130 




(1S4) 

aylanfBVnsp2(1S1) 
MHVr»sp2(153) 
SARS nsp2 (103) 
.8CpVnsp2(153> 
Consensus (154) 



........ ^F^C^fii 'f Q^WqtgPtf ^^rRSfWG^^JiPspJUAC^ 

KFGVVK PGETPWLAA YIJGSfiQGA FH VTIiRS S HTIKGSFLCG SCGS VG F V I 

' — ' — ■■ \ — r— — ^ SecUon 4 • 

154 .160^ .170 ' "^QQ 204 

vie- Vi4^f I^^E rft*4e5J*51?'DFS GilF ^ 
iX.)?'<?^-^'SFC|?|frl|-MSE.P*SVHAQT^ 

DGDCVKFVYMHCJLBI.Sa'GCHTGf'DIii GDPYGPYVDAQVVQLPVQDYTQTVM 



(205) 

awlan lBVnsp2(202) 
MHVnsp2 (204) 
SARS nsp2 (204) 
BCoV nsp2 (204) 
Consensus (205) 



2 05 210 220 
gVAWLYAA'^j^iSV.KESSPSLPKl 



.230 



-SecQonS 



240 



.\|iAwt,^/?^g]cis?obR W:pi 



(250) 

avian IBVnsp2{251) 
MHV nsp2 (247) 
SARS nsp2 (249) 
BCoV nsp2 (247) 
Consensus (256) 



mj:mi^^P.J^^im .-t5?flQS£>K4sfeB*'WV5?^fiMG^^ 

WAWLYAAIIN CN WFLQSDTCSIiEDFHW/AMSNGFSPXKSt) L 

SecSon 6 



256 



27Q 



280 



290 



(307) 

avfan lBVftsp2(301) 

MHVnsp2(297) 
SARS nsp2(300) 
BCoV nsp2 (297) 
Consensus (307) 



^i'^KLiS-X'GVDVCKLIiRTiMVKNfS-QlGaDPJ.IiGgYN^ 
VlL D Al1^|M l-Oy^.VEQV;L A A I K R.l.H:S - <3.FQG^^ 
ipl|GP L^A'Q ro^flAV LD^CAAlSCEXiLQNQEjJsiCCRTXliGSTi^ 
V^]DA1.M^H V^ETL LA AIKRL.KI<I " G FGG RQliGS CS P EDEiiT PSDY 
VIDALAAMTGVSVE LLAAIKR£i S GFQGRQlIiGSCILEDEl.TPSDVYQQ 
: : '• — Section 7 




SEQ U) HO: 6570 

SEQ ID HO: 6571 

SEQ ID HO: 6569 

SEQ IP BTO: 6572 

SEQ ID HO: 6573 
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FIGURE 17 
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FIGURE 18 




BNSDOCID: <WO___20040923e0A2_l_> 
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FIGURE 19 
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FUNCTION 



Receptor 
binding 



STRUCTURE 




Otigomenzation % 
dom^n Leucine 
^ zipper 



Oligomerizatlon 
dommn 




Leucine 
zipper 



Plasma 
membrane 



domain 



m Cystelne-rlcii domain 
> Cytopfasmic tail 



S1 

(globular domain) 



S2 

(stalk domain) 
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FIGURE 20 



Group 2 




HMSDCKID: <WO__200409Z3eOA2_l_> 
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FIGURE 21 A 

G1 cons 



100 >^ 

SARS 



.G2 cons 



FIGURE 21B 



HlillllllHillHIilillili^m 



i^^iiieBaiiiiiii 



AIBV 
G1 cons 

SARS 
G2 cons 



FIGURE 22 



colled-coil membrane anchor 




122 143 181 



I I J I 

269 287 331 351 405 aa 
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FIGURE 23 

LPRKSQPTS I SCRSVIi-TNFKICVAVARLHA-CTYAV-TI INFTVVDKKRVTRPSSADCL 
RFRPCC SRS S AYLGFVRV- PKGKMESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEE 
ALSEAREHLKNGTCGIiVELEKGVriPQLEQPYVFIKRSDALSTNHGHKVVEL.VAEMDGIQY 
GRSGITLGVLVPHVGETPIAYKNVLLRKNGNKGAGGHSYGIDLKSYDLGDEtiGTDPIEDY 
EQKnAnSTTKHGSGALRELTRELNGGAVTRYVDNNFCGPIXSYPLDCIK^ 

QLDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPL 

NSKVKVIQPRVEKIOCTEGFMGRIRSWPVASPQECNlSnyiHLSTLMKCNHCDEVSW 

KATCEHCGTENLVIEGPTTCGYLPTNAWKMPCPACQDPEIGPEHSVADYHNHSNIETRL, 

rkggrtrcfggcwayvgcynkraywprasadigsghtgitgditv:etlnedlleilsre 

RVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPV 

kgawnigqqrsvltplcgfpsqaagvirsifartldaanhsipdlqraavtildgiseqs 

DRLVDA]WYTSDDLTNSVIIMAYVTGGLVQQTSQWLSIILLGTTVEKLRPIFEWIEAK^ 

gveflkdaweilkflitgwdivkgqiqvasdnikdcvkcfidvvnkalemcidqvtiag 

AKLRSIiNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEW 

LKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGIiLATNNVFRLKG 

GAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKCSWTVESGTEVTEFACW^ 

EAVWTIiQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAE 

CEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEP 

TPEEPWQFTGYLKLTDWAIKCVDIVKEAQSANPMVIWAANIHDKHGGGVAGALNKAT 

NGAMQKESDDYIKLNGPLWGGSCLLSGHNIJ^CLHWGPNLNAGEDIQLLKAAYENFN 

SQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVlSnDKALYEQVVMDYLDNL 

APKQEEPPNTEDSKTEEKS WQKPVDVKPKIKAC IDEVTTTLEETKFLTNKLLLF ADING 

KLYHDSQNMLRGEDMSFLEKDAPYMVGDVITSGDITCWIPSKKAGGTTEMLSRALKKVP 

VDEYITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAH 

AEETRKLMPICMDVRAIMATIQRKYKGIKIQEGIVDYGVRFFFYTSKEPVAS I ITKLNSL 

IsTEPLVTMPIGYVTHGFNLEEAARCMRSLiKAPAWSVSSPDAVTTYNGYLTSSSKTSEEHF 

VETVSI^GSYRDWSYSGQRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLL. 

SLREVKTIKAn^TTVDNTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHV]^^ 

LPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWTKFPQVGGLTSIKWADNNCYLSSV^ 

LiALQQLEVKFNAPALQEAYYRARAGDAAISnFCALILAYSNKTVGELGDVRETMTHL 

LES AKRVIiNWCKHCGQKTTTLTGVEAVMYMGTIiSYDNLiKTGVS I PCVCGRDATQYLVQQ 

ESSFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYK 

GPVTDVFYKETSYTTTIKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPL 

PNASFDNFKIiTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDWAIDYRHYSASFK 

KGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNLA 

CESQQPTSEEWENPTIQKEVIECDVKTTEWGNVILKPSDEGVKVTQELGHEDLMAAYV 

ENTSITIKKPNELSLADGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSNCAKRL 

AQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDAGINYVKS 

PKF-SKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYLNSSIWTTMD 

FCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGIiAAEWVIiAYMLFTKFFYL 

LGLSAIMQVFFGYFASHFI SNSWLMWFI I SIVQMAPVSAMVRMYIFFASFYYIWKSYVHI 

MDGCTSSTCMMCYKRNRATRVECTTIWGMKRSFYVYANGGRGFCKTHNW^ 

GSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSH 

PVNLDNLRANNTKGSLiPIWIVFDGKSKCDESASKSASWYSQLMCQPILLL^ 

GDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQG 

VVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFl^TYNKVENMTPRDLGACIDCNARHIN 

AQVAKSHWSLIW3WKDYMSLSEQLRKQIRSAAKKNNIPFRLTCATTRQVVWITTKISL 

KGGKIVSTCFKLiMLKATLLCVLm^LVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDI 

I STDDCFANKHAGFDAWFSQRGGS YKNDKSCPWAAI ITREIGFIVPGLPGTVLRAINGD 

FLHFLPRVFSAVGNICYTPSKL.IEYSDFATSACVDAAECTIFKDAMGKPVPYCYDTNLLE 

GS I S YSELRPDTRYVLMDGSI IQPPNTYLEGSVRWTTFDAEYCRHGTCERSEVGICLST 

SGRWLNNEHYRALSGWCGVDAMKn^IANIFTPLVQPVGAIiDVSASWAGGIIAILW 

AYYFMKFRRWGEYNHWAANALLFLMSFTILCLVPAYSFLPGWSWYLYLTFYFTlSroV 

SFXJ^LQWFAMFSPIVPFWITAIYVFCISLKHCHWFFNNYLRKRVMFNGWFSTFEEi^ 
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CTFLLNKEMYLKLRSETLLiPLTQYNRYLALiYNKYKYFSGALDTTSYREAACCHLAK^ 

FSNSGiUDVIjYQPPQTSITSAVLQSGFRKMAFPSGKVEGCWQWCGTTTLNGLWLDDTVY 

CPRHVICTAEDMLNPNYEDLLIRKSlOTSFrjVQAGlWQIjRVIGHSMQNCLiljRLKVDTSNPK 

TPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCV 

SFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLIWIiAWLYAAVINGDR 

WFIiNRFTTTLNDFNLVAMKYlSrYEPIiTQD 

GRTILGSTILEDEFTPFDVVRQCSGWFQGKFKKIVKGTHHWMLLTFLTSrililliVQSTQW 

SLFFFWENAFLPFTLGIMAIAACAMLLVKHKHAFLCLFIiLPSLAWAYFNM^^ 

MRIMTWLELADTSLSGYRLKDCVMYASALVLLimTARTVYDDAARRVWTLiy^ 

VYYGNALDQAI SMWALVI SVTSNYSGWTTIMFIiARAIVFVC^rEYYPLLF ITGNTLQC IM 

LWCFLGYCCCCYFGLFCLLJ^YFRLTLGWDYLVSTQEFRYMNSQGLLPPKSSIDAFK^ 

NIKLLGIGGKPCIKVAWQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILL 

AKDTTEAFEKlWSLLSVIjLSMQGAVDINRIjCEE]mJDNRATL.QAIASEFSSLPSYAAYA^ 

QEAYEQAVAWGDSEWLKKLKKSIJWAKSEFDRDAAMQRKIiEKMADQAMTQM^ 

KRAKVTSAMQTJ^FTMLRIOjDNDAIjNNI innardgcvplni iplttaaklmvwpdygty 

KNTCDGNTFTYASALjWE I QQWDADSKIVQIiSE INMDNS PNLAWPLIVTAIiRANSAVKIiQ 

ISnsrELSPVALRQMSCAAGTTQTACTDDNALAYYTSlNSKGGRFVIjALL 

GTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNIjNRGMVTjGSIxAATVRIjQA 

PANSTVL S FCAFAVDPAKAYKDYLASGGQP I TNCTVKMIjCTHTGTGQAI TVTPEANMDQES 

FGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRim/CTVCGMWKGYG 

C S CDQLRE PLMQS ADASTFIiNGF AV- VQPVLHRAAQALVLMS STGLL. I FTTKKIjLVLQS S 

-KIjIAVASRRRMRKAIY-TLTL-LRGILCLTTNMKRLFITWLKIVQRLLSMTFSSLE-MV 

TWYHIYHVSV-IiNTQWLI-SMLYVILMRVIVIH-KKYSSHTIAVMMIISIRRIGMTS-RI 

liTSYAYMLT-VSWAJmY-RIiYNSAMLCVMQAIi-AY-H- 1 IRILMGTGTI SVI S YK-HQA 

AEFLL-WIHITHC-CPSSIi-LGHWI.LSPIVMLISQNHLI.SGIC-NMI 

IGTRHTIPIVLTVWMIGVSFIVQTLMCYFLIiCFHLQVLDH EKYL-MVELLLFQIiDTIF 

VS -ESYI IRM-TYIARVSVSKKTF-CMIiLIQLCMQLLAI YC- INALHAFQ-LH-QTMLLFK 

LSNPVIIilKTFMTLLCLKVSLRKEVLLN-NTSSLIjRMATLIjSVIMTIIVIICQQCVISDN 

SYS -LKLLINTLIVTMVAVTjMPTK- SLiTIWINQIiVSHLINGVRLDFIMTQ-VMRIKMHFS 

RILSVMSSLL.-LK-ILSMPLVQRIELAP-LVSIiSVVL-QIDSFIR]Snf-SQ-PPrjEELLW- 

LEQASFTVAGIIC-KLFTVM-KLHTIjWGIIQNVTEPCLTCLG-WPLLFLIiANITIiAVTY 

HTVSTG-DTSVRKY-VRWSCVAAHYMLNQVEHHPVMLQLLiMLIVSIiTF^ 

QLMVI R- LT SMS AI YNTG SMS VS IE IGML IMNS WMSFTLTCVNI S P — FFLMMPLCAITV 

TMRLKY — LALRTLRQFF 1 1 KIMCSCrjRQNVGLRLTLLiKDriTNF AHS I QC - LNKEMI TCT 

CliTQIHQEY-AQAVIiSMILSKQMVHL-LKGSCHWLLMLTHIiQNILIRSMLMSFTCIYNTL. 

ESYMMSLIATCWTCIP-C-LMITPHGTGNIiSPMRIiCTHHIQSGRL-VIjVYCAIHRIiHFVA 

VPVLGDHSYVASAAMTMSFQHHTN-CCLLiIPiyiFAMPQVVMSIjM-HNCI-EV-AIIASHIS 

L PliVFHYVLMVRFLWTKTHV- AWMSLTSMR- QHVI GLMIAI TYL.PTLVLRDS SF SQQK 

RSKPLRKHL SCHM\^PL YAKYS LTENC I FHGRLENLDHH - TETMSLiLVTV- LKI VKYRLE 

STPLKKVTMVMLLCTEVLRHTS-MLVITLC-HLTIi-CHLVHLL-CHKSTM-ELLACTQHS 

TSQMSFLAMLQI IKRSACKSTLHSKDHLVLVRVILPSDLLS ITHLLA-CIRHALMQLDMP 

YVKRH -NI C P - INVVE S YLRVRA- SVL INSK- 1 QH -NSMF S AL -MHCQKQLLTLj - SLMKS 

LWTLLIMT-VLSMLDFVQNTTSILAILLNYQPPAHC-LKAH-NQNILIQCADL-KQ-VQTC 

SLELVAWLLKLLTL-VL-FMTI S-KHTRI SQLNASKCSTKVTiLHMMFHLQSTDIiK-AL- 

ElSTFLHAILIjGEKLFIjSHIjIIHRTIi-LQKS-DCIiRRLiLIHHRVLNJ^ 

MSTASMWIiSQGQKLAFCA-CLIEIFMTNCNLQV-KYHVAMWLHYKQKM-LDFLRTVVRSL 
LVF ILHRHLHT SAIil - S S RLKDYVLT YQAYQRT - PTVDS S li - WVSK - 1 TKSMVTL. I CLiS P 
AKKLFVTFVRGLALM- RAVMQLEMLWVTjT YLS S -DFLQVLT — LYRLVMLTLKI TQNS PE 
LMQNIiHQVTSL.NILYHSCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALiSIiHQ- 
STL SRLDLKERWCVTNVQLAFLiIjHQ I LMPAG I ILiWVTiTMS I THL - LMFS SGALRVTFRV 
TMTNIARYMEMHMWHLVVMLS-LDV-QSMSAIiLSALIGLLNTLL-EMN-GLILIJ^K^^ 
L - SLHCLL I SFQFFMTLEIQRLS SVCLRLK-NGS STMLSHVVTKLTK-RNS S IIJ^ 
NSIiMVFVCFG I VTL I VTQPMQLCVGIjTQESCQT - TYQAVMVWCM- 1 SMHSTLQLS I KVH 
LLI - SNCLSFTIL.IVL.VSLMANK-CRIIiIMFHSNIiLRVLiHDAI -VVLF^ 

MHII FliLDLAYGFTNKTLILITCGIHIjPGYRV-KMWLIMLIilKDTLimDTPAKHIiFPSLI 

MLFTQR-MVLKromSLKIRQHFLIJilLHIiSFGLSWLN^ 
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KEKPQHMYLQ-VSAQ-LTLPKNLLRVLVLHDLSCDMVEWKDR-TFLE^ QKVQS 

KV-HLQRDQHKLASMESH-LENQ-KHSIiTTLRK-TALFNSCLKPTLLRAET-RZLSPDHK 

WKLTFSSSLWMNSYSDISSRAMPSNTSFMEISVMDNLAVFI A-PSAHKIHHLN-RIL 

SLWTAQ-KITS-QMRKQVHQWCVL-LIFYLMTLSR--SHKICQ-FQKWSRLQLTMLKFH 

SCFGVRMDMLKPSTQKnfKQVERGNQVLRCLTCTRCKECFLKSWFRIlWKMLLYQKE 

MSQSILNCV]^-IHLL.-LYPTT-ELFTLVIJU^IKEL.HQVQI.CSDNGCQLAHYLSIQILMT 
SSPTHILL-LETVQQYIRLINGTLLLAICMTIXSPNM-QKRMTLKKGFSLICVDL-SKN-P 
WVVL-L-R-QSIIX3MDTFTSLWAISHGGQLLLQM-]smHHRKHF-LGLTILASRRNKL]y^ 
PCMIiTTFSGGTQILSSCLPIHSLT-ANFLLN-EEDL-CLLRRIKSMI-FILFWKKVGLSL 
EKTTELWFQVIFLLTTKRTCLFSYYFLLSLVWTLTGAPLIiMMFKLLITIiNILHL-G^ 
imKFLDQTLFI -LRI YFFHFILMLQGFILLI IRLATLSYL.LRMVFII.LPQRNQ1^ 
FLVLP-TTSHSR-LLLTILLMLLYEHWLNCVTTLSLLFIJJJPWVHRHIL-YSIiy^ 
STYIJyiPFRLMFQKSQVILIWYESLCLiKIKMGFSMFIRAINL-M-FVIYLLVLTL-NLFLS 
CLDVLTLQILEPFLQPFHLLKTFGARQLQPILLAI-SQLHLCSSMVIKWQSQB^ 
HLLNSNALLRALRI,TKEFTRPI,ISGI.FPQEML-DSLILQTCVLLERFIi]y^ 
EKKFLIVLLITLCSTTQHFFQPLSAMAFLPLS -MIFASPMSMQILL- SREMM-DK-RQD 
liVLLIil 1 1 INCQMI SWWSI.LGILGTLMLLQLVI 1 1 INIGILDMASLGPLRETYLMCLS P 
. LMAlSTLAPHIiliL IVIGH-MI]WFTPLLALAT]Snj 

TSVS ILILl^SLVLVC -LLLQRDFNHFMSTLAVMFLI SLI PFEILKHLKY-TFHLi^ 

V-LHLEQMLHLKLLPYIKMLTAIJy[FLQQFMQINSHQLGAYII,LETMYSRI.KQAVL-ELSM 

STLLMSATFLLELAFVIiWIQFLYYVVLAKNLLWLILCIi-VLIVQLLTLITPLIiYLL^ 

I^LQK-CLFLWLKPP-IVICTSAEIDL]WLICFSN]WAFAHN-IVHSQVLL 

KCSLKSrjKCTKPQD-NILVVLIFHKYYLTL-SQLRGLLLRTCSDIR-HSLMDAS-SKnvi^ 

A-VIIJyiLEI SFVRRS SMDLQCCHLCSIJ^EI -LLPTLLL-LWLPLDIXSHL^ 

CKWHIGSMALELPKMFSMRTIOSrKSPTNLTRRLVKF 

THLLNNIiALILVQFQVC-MISFRDLIKSRRRYKLTG-LQADFK^ 

LLIilLLLLKCLSVFLDNQKELTFVERATTLCPSHKQPRMVLSSYMSRMCHPRRGTSPQRQ 
QFVMKAKHTSLVKVFLCIJyiALLGLI.HRGTSFLHK-LLQTIHLSQEIVM 

LCNLSLTHSKKSWTSTSKI IHHQMLILATFQALTIiLSSTFKKKLTASMRSLKI -MNHSLT 
•FKlsnrtTENMSNILNGLGMFGSASLLD-LPSSWLQSCFVA-LWAVASRVHALV^^ 
TLSQFSRVSNYITHKRTYGFVYEIFYSWINYCTASKN-QCFSCKYCSCYSNDTATSDTPF- 
RMACYWRCISCCFSERYQ]SINCAQ-KMAASPL-GLPVHL.QFTAAICYHIiFTSFACRCRYGG 
AIFVPLCLDIFSTMHQRM-lSrYX-EMLALLEVQIQEPITIi-CQLLCIilJUiT-L 
HRYNCRY-R-RHFNTKTQRRLPNWWTDF-G-ALRC-RLCRCTWLFHRSLLPA-WT^ 
WY-KCYILHIi-QAC-RPTECANTHNRRLFRSC-SSNGSNL — ADDDY-RAFVSTRK-VRT 
YVLIRFGRNRYVNS— RTSFSCFRGILASHTSHPYCASIVCVLLQYC-REFSKTNGLRLL 
AC-KSELF-RS S -SSGLNELTI 1 1 ILFGTDTLLIMADNGTITVEELKQLLEQWNLVIGFL 
FLAWIMLLQFAYSNR]SrElFLYIIKLWLWLmPVTLACFVLAAWR 

VGLMWLSYFVASFRLFARTRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRG 
HLRMAGHSLGRCDIKDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYlSfRYRIGNYKLN 
TDHAGSNDNIAIiLVQ-VTTDVS SC-LPGYNSRDIDYHYEDPQDCYt.ES-RYNKFNSETI I 
-ASN- EELFGVR- -RTYGVRLS IKRT-KLFSS-H-LYLHLASYITIRSVLEVRLYY-KNIi 
AHQEHTRAIHHFTLLLTINLH-LAIJ^TLLLLVIiTVIiDIPISCVQDQFHQNFSSDKRRFN 
KSSTRHFFSLLLL-YF-YFASPLRERQNE-AHFN-IiI.FVL.FSI.SAIPCPNNAYYILVFTR 
NPGSRRTLYQSLNEHETSHCFDIiYFSMQIiHMHCSTALCI~TSCA-RSL-GTTLGVILIA 
LLGFVL-ERFYLFIDGTLWFKHAHmLDSTVKIQDVVRL-LGVGTFMKVTKLLHLETYL 
F-INEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGKISrGARPKQRRPQGLPNNTAS 
WFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKEDSPRWYFY 
YLGTGPEASLPYGANKEGIVWATEGALNTPKDHIGTRNPNlSnSIAATVLQLPQGTTLPKGF 
YAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRLNQLE 
SKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDL 
IRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVI 
LLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPTVTDLPAADMDDFSRQLQNSM 
SGASADSTQA-TLMVETTQGRWAM-TFSQFRLRYIVYSCAE-ILVTKQHK-V-LTLISHSN 
L-SMCNIREDLKEPPHFHRGHAEYDRGYSE-C-GELPIWKSPNV-N-F-CYPHVILIAS 
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FIGURE 24 

tqekptnldll- icslnel-nlcscrsaaclvhlrs im^^ 

vs svlqs 1 1 s i prfrpgvter-dgepc swcqrentrptqfac p'sg-rrasawlrglcgrg 
piggt-tpqkwhlwssragkeirtapa-tal.cvh-tf-cdkhqsrpqgr-agcrngrhsvr 
s-ryntgstratcgrnpncipqcsss-er — gsrws-lwhrskvl-lr-rawh-sh-rl- 
tkleh-awqwctp-ths-aqwrcshslcrqqflwprwvps-lhqrfsrtrgqvnvhsfrt 
t-lhrveerciillp-p-a-nclvh-al — elrapdtlrn-ecqei-hfqrgmpkvcvss- 
lksqshsttc-kekd-gfhgaytlcvpccistgv-qyalvyldem-slr-sfmadvrl.se 
shl-tlwh-kfsy-rtyymwvpty-ccsenamsclsrprdwt-a-ccrlsqplkh-nstp 

QGR-D -MFWRLC VCLCWLL - - ACLLGS S C - C - YWLRP YWH YW- QCGDLE -GS P -DTE S -T 

c-h-hcwrfsfe-rgchhfgiflcfykcly-hykes-lqvfqnhc-vlr-l-syqgkark 
rclehwtteisfnttvwfsltgcwcyqinfcaht-cskplns-faksschht-wyf-tvi 
tscrrhglyfrpahqqchyygicnwwscttdfsvw-sfghyc-ktqayl-md-get-cr 

S - 1 SQGCLGDSQI SHYRCP -HRQGSNTGCFR- HQGLCKMLH -CC - QGTRNVH - S SHYRWR 

kvaitqlr-slhrskqgtlpsvytwqgaaatthas-gtkrsnls-r-ft-hstyl-ggcs 
qer-trstrdar--lhkwsyrrhtslckwphalrd-gqrtilrivswftgykqclslkrg 
ctn-rcnlwrryclgssrlqecenhi-a — tc-qsa — kvlclhc-iryrsy-vcmccsr 
gccedfttsf-spyqhgy-s — vecsyilli--cw-rklfityvlfllpsr-grrgrcrv 
-grrn — nl-t-vryrg-lsrspsgiwcls-msss-grrrgrlag-yy-aird-artrty 
t-rts-svywlfkty-qcch-mc-hr-ggtkc-sygdckcc-htpetwwwcsrctqqgnq 
wchakge — lh-akwps ysrrvlfafwt- sc-evsaccwt-pkcr-ghpas-gs I -kfqf 

TGHLTCT I WSRH I WC - TT S VFTSVRADG S YTGLYC S Q - QS S L - AGCHGLS — PEA- SGS 
T-TRGATKHRRFQN-GEICRTEACRCEAKN-GLH — GYHNTGRN-VSYQ-VTLVC-YQW- 
ALP-FSEHA-R-RYVFP-EGCTLHGR-CYH-W-YHLCCNTLQKGWWHY-DALKSPEESAS 
— VYNHVPWTRMCWLYT-GS-DCS-EMQICILCTTFRST-C-GRDSRNCILEFERNACSC 

-RDKKINAYMHGC-SHNGNHPT-V-RN-NSRGHR-LWCPILLLY RACSFYYYEAELSK 

-AACHNANWLCDTWF- S -RGCALYAFS - SSCRSVSI ITRCCYYI -WIPHFVIKDI -GALC 
RNSFFGWLLQRLVLFRTAYRVRC- I S-AW-QNCVPHSGEPRRVSS -R-GSFT-QTKESLI 
PAGG-DYKSVHNCGQH-SPHTACGYVYDIWTAVWSNILGWC-CYKN-TSCKS-G-DFLCT 

T HTT — SFRVLPYS — EFSW-VHVCFKPHKEMEISSSWWFNFN-MG--QLLFV-CFI 

STSTA-SQIQCTSTSRGLL-SPCW-CC-LLCTHTRLQ — NCWRAW-CQKNYDPSSTAC-F 
GICKASS -CGV-TLWSENYYLNGCRSCDVYGYS IL — S -DRCFHSMCVWS -CYTI SSTTR 
VPPCYDVCTTC - V- ITARYI LMCE- VHW-LSVWSLHS YNC -GDPLS Y-RS SPYKDVRVQR 
TSD-CFLQGNILHYNHQACW-TRWSYLHRD-TKIGWVL-KG-CLLYRAAYRPCTNSTIT 

KCEF FQTHMF-HKIC FKSNDRLHKASFTRAICHILPRLEWRCSGY-L-TLFSEFQE 

RC - ITA- ANCLAH- PGYNQDNVQTKHLVPTLSLEYKASRYFKF I - S SGSRRHTRNGQSCL 
-KSTTHL-RSSGKSYHTEGSHRV-RENYRSCRQCHT-TIR-RC-SNTRVRS-GSYGCLCG 
KHKHYH-ET--AFTSLRFKNNCHSWYCCN — CSLE-NFGLCQTILRTSSNYNIKLR-EIS 
TTCV- QL YALCVYI IVP IVYF Y-KYQF -N- SFTTYNYC -K- C -ECC - IMFGCRH - LCEVT 
QIF- IVHNRYVAIVVKYLLRFSNLCNCCFWCTLI -FWCSFLL-WR-RIVS -FV-RYYYGF 
L-RFFSLQHLFKWIRLP-FLSSS-NHSGDDFIVQARLDNFRSGR-VGFGIYWHKILLFI 
RSFSYNAGVLWLFC-SFHQQFLAHWYH-YCTNGTRFCNG-DVHLLCFFLLHMEELCSYH 
GWLHLFDLHDVL-AQSCHTR-VYNYC-WHEEIFLCLCKWRPWLLQDSQLEDSQL-HILHW 

- YIH SCS -FVTPV-KTNQP Y- PVI VYC — CCCEKWRASPLL - QGWSKDL-ETSALPF 

CQFRQFES-QH-RFTAY-CHSF -WQVQMRRVCF- VCFCVLQSADVPTYSVA- PS sc irrw 

r-y-sfr-dv-clcrhlfsnf-csygkt-gtccyssqrvskgcsfrwcpfyirvscptrc 
c-yrc-hkgcy-msqtftsl-lrsdr-ql-qfhahl — g-khdaqrswrmy-l-ckayqc 
psskksqcfthleckrlhvfi-taa-tns-ccqeeqhtf-tnlcyn-tgcqchny-nltq 
gw-dc-ylf-tya-ghiivrsccigllyryastyivnp-wlhk-nhwlqshsgwchs-hh 
FY — lfck-tcwf-kmv-pawwfiqk-qklpcsscyhykrdwfhsawltgycaesnqw-l 
lafstscp-ccwqhllhtfqth-v — pcylclrscc-vynp-gcygqtcaill-h-parg 
FYFL — ASSRHSLCAYGWFHHTVS -HLPGGFC - S SNNF-C -vl-twymrkvrsrylpi yq 
w-MGS alqssirsflwc-cdeshs-hlysscatcgcfrcvcfssgwwyychigdlcc 

LLLYEIQTCFW-VQPCCCC-CTFVFDVFHYTLSGTSLQLSAGSLLSLLLVLiDILFHQ-CF 
ILGSPSMVCHVFSYCAFLDNSNLCILYFSEALPLVL-QLS -ERSHV-WS YI - ylrggcfv 
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YLFAQQGWPKIA-RDTVATYTV-QVSCSI-QVQVFQWSLRYYQLS-SSL.LPLSKGSK-Ii 
-QLRC-CSLPTTTDINHFCCSAEWF-ENGIPVRQS-RWGTSNLlAnsrYNS-WIWG-HSIIj 
SKTCHL.HSRRHA-S-L-RSAHSQIQP-L.SCSGWQCSTSCYWPFYAKLSA-A-S-YF-P-D 
TQV-ICPYPTWSNIFSSSMLQWFTIWCIiSVCHET-SYH-RFFP-WIMW-CWF-H-L-LRV 
FLLYASYGASNRSTRWY-L.RR-ILWSIC-QTNCTGCRYRHMHNIKCFGMAVCCCYQW — V 
VS — IHHYFE-L-PCG3SrEVQL-TFDTRSC-HIGTSFCSmU^CRLRYVCCFERAAAEWYEW 
SYYPW-HYFRR-VYTI-CC-TMLWCYL.PR-VQENC-GHSSIiDAFNFLDITIDSCSKYTW 
TWLCLRECFIiAIYSWYYGNCCMCYAAC-A-ARILVLVSVTFSCNSCLL-YGLHAC-LGD 
AYHDMA-IG-H-LVWL-A-GLCYVCFSFSFAYSHDSSHCL — CC-TCLDTDECHYTCLQS 
liLW-CFRSSYFWGLSYFCNL-LFWCRYDYHVFS-SYSVCVC-VLPIVIYYWQHLTVYHA 
CLLFLRLLLLLLLWPFLiFTQPLLiQAYSWCIj-IiIiGL.YTRI-VYELPGAFAS-E-Y-CFQA- 
H-WGYWR-TMYQGCYCW-NV-RKVHICGTALGSSTT-SRVIF-IVGTMCTTPQ-YSSC 
KRHN-SFREDGFSFVCFAIHAGCCRH — WRGNAR-PCYSSGYCFRI -FFTI ICRL.CHCP 
GGL-AGCS-W-F-SRSQKVKEIFECG-I-V-P-CCHATQVGKDGRSGYDPNVQTGKI-GQ 

EGKSN-CYANNALHYA-EA CT-QHYQQCA-WLCSTQHHTIDYSSQTHGCCP-LWYLQ 

EHL-W-HLYICICTLGNPASC-CG-QDCST — N-HGQFTKFGLASYCYSSKSQLSC-TTE 

TESSSTTTDVLCGWYHTNSLY QCTCLL-QFEGR-VCAGITIRPPRSQMG-IP-E-W 

YRYNLHRTGTTL-VCYRHTKRA-SEILVLHQRLKQPK-RYGAGQFSCYSTSSGWKCYRST 
CQFNCAFLLCFCSRPC - S I -GLPSKWRTTNHQLCEDWYTHWYRTGNYCNTRS -HGPRVIi 
WWCFMLSVL-MPH-PSKS-RIL-LER-VRPISIT-YHLC — PSGFYT-KHSLYRIiKNVERLWIi 
-L-PTPRTLiDAVCGCINVFKRVCGVSAARIiTPCGTGTSTDWYRAFDIYNEKVAGFAKFLj. 
KTNCCRFQEKDEEG]SnLLDSYFVVKRHTMSITirQHEETIYNI.VKDCP 
MVPHISRQRLTKYTMADIiWALRHFDEGNCDTLKEILVTYNCCDDDYFlS^^ 
DILRVYANLGERWQSIiLKTVQFCDAMRDAGIVGVLTLDNQDLNGlWirDFGD 
GVPIVDSYYSLl^PIIiTLTRAIJU^SHMDADLAKPLIKWDLLKYDFTEERLCLFDRYFKY. 
■ WDQT YHPNC INCLDDRC ILHCANFNVLF STVF PPTS FGPLVRKI FVDGVPFWS TG YHFR 
ELGVVHNQDVNLHSSRLSFKELIiWAADPAMHAASGNLLLDKRTTCFSV 
WPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQL. 
LFVVEVVDKYFDCYDGGCINANQVIVMNLDKSAGFPFNKWGKARLYYDSMSYEDQD 
YTKRNVI PTITQMNLKYAI SAKNRARTVAGVS ICSTMTNRQFHQKLLKS lAATRGATWI 
GTSKFYGGWHNMLKTVYSDVETPHLMGWDYPKCDRAMPNMLRIMA 

HRFYRLANECAQVTjSEMVMCGGSLYVKPGGTSSGDATTAYANSWNICQAVTAIWNAL^ 
TDGNKIADKYVRNLQHRLYECLYRimDVDHEPVDEFYAYLRKHFSlXM 

YAAQGIiVASIKNFKAVLYYQNlWFMSEAKCWTETDIiTKGPHEFCSQHTMLVKQGDDYVYL 

PYPDPSRILGAGCFVDDIVKTDGTLMIERFVSIiAIDAYPLTKHPNQEYADVFHLiYLQYIR 

KLHDELTGHMLDMYS VMLTNDNT SRYWE PEF YE AMYTPHTVLQAVGACVLC 

ACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKP 

PISFPLCANGQWGLYKNTCVGSDWTDFNAIATCDWTNAGDYILANTCTERLKLFAAET 

LKATEETFKLSYGIATVREVLSDREDHLSWEVGKPRPPLmysrYWTGYRVTK^ 

YTFEKGDYGDAWYRGTTTYKLNVGDYFVLTSHTVMPIiSAPTLVPQEHYVRITGLYPTIiN 

ISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDAL 

CEKALKYLP IDKC SRI I PARARVECFDKFKVNSTLEQYVFCTVNALPETTADI WFDEI S 

MATNYDLSVWARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLimCTIGP 

LGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIG^^ 

EFLTRNPAWRKAVFISPYNSQNAVASKILGIiPTQTVDSSQGSEYDYVIFTQTTETAHSCN 

VNRF1WAITRAKIGILCIMSDRDLYDKLQFTSL.EIPRRWATLQAENVTGLFKDCSKIIT 

GLHPTQAPTHL S VD IKFKTEGLC VD I PG I PKDMTYRRL I SMMGFKMNYQVNG YPNMF I TR 

EEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVNIiVAVPTGYVDTENNTEFTRV 

NAKPPPGDQFKHLIPLMYKGLPV<nWVRIKIVQMLSDTLKGIiSDRVVFVLWAHGFEL^ 

YFVKIGPERTCCLCDiaiATCFSTSSDTYACWNHSVGFDYVYNPFMIDVQQWGFTGlKriiQSN 

HDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWSVEYPIIGDELRVNSACRKVQHMV 

VKSALLADKFPVIiHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDK 

FTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAF 

TNLKQIiPFFYYSDSPCESHGKQWSDIDYVPLKSATCITRCNIiGGAVCRHHANEYRQYLD 

AYNMMI SAGFSLWI YKQFDTYNLWNTFTRLQSLENVAYNVVNKGHFDGHAGEAPVS I INN 

AWTKVDGIDVEIFENKTTLPVNVAFEIiWAKRNIKPVPEIKIIJjaNLGVDIAANTVi;^ 
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REAPAHVSTIGVCTMTDIAIOCPTESACSSLTVIiFDGRVEGQVDLFiysrARNGVL 
GliTPSKGPAQASWGWIjIGESVKTQFNYFKECVIDGIIQQLPETYFTQSRDLEDFKPRSQM 
ETDFLELAimDEFIQRYKLEGYAFEHIVYGDFSHGQLGGLHIiMIGLAKRSQDSPL.KLEDFI 
PMDSTVKNYF I TDAQTGS SKCVC SVIDLLLDDFVEI IKSQDLSVI SKWKVTIDYAE I SF 
MLWCKDGHVETFYPKLQASRAWQPGVAMPNLYKMQRMLIjEKCDLQlTyGENAVIPKGIt^^ 
VAKYTQIiCQYLNTLTIjAVPYlsnmiVIHFGAGSDKGVAPGTAVriRQWIjPTGTLiIjV^ 
VSDAYSTLIGDCATVHTAKTECWDIillSDMYDPRTKHVTKEiroSKEGFFTYLCGFIKQI^^ 
GGSIAVKITEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFLIGAlSrYLGK^ 
MHAJSryiFWRlsrTNPIQDSSYSriFDMSKFPDKLiRGTAVMSIjKENQIlSroMIYSIiliE 
EMNRVWS SD I LVMN- TNMF I PLLPLTLTSGSDLDRCTTPDDVQAPNYTQHTS SMRGVYY 
PDE I FRSDTLj YLTQDLFLPF YSNVTGFHT INHTFGNPVI PFKDG I YFAATEKSNWRGWV 
FGSTMNNKSQS VI I INNSTJWVIRACr^ 

Y I SDAF SLDVSEKSGNFKHLREFVFKMraDGFLjYVYKGYQP IDVVRDIjPSGFNTL^ 

PLGINITNFRAHjTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNP 

LAELKCSWSFEIDKGIYQTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKFPSVYAWER 

KKISNCVADYSVIiYNSTFFSTFKCYGVSATKIjNDriCFSNVYADSFVVKGDDVRQIAPGQT 

GVIADYNYKLPDDFMGCVIuAWNTRNIDATSTGlJaTSIYKYRYLR^ SNVPFS P 

DGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWIiSFELLNAPATVCGPKLSTDIjIKN 

QCTNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEIIiDISPCAFGGVS 

VITPGTNASSEVAVriYQDWCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHV 

DT S YECD I P I GAG I C AS YHTVSIiLRSTSQKS I VAYTMSLGADS S I AYSNNT I AI PTNF S I 

SITTEVMPVSMAKTSVDC]SnX[YICGDSTECA]SrLLIjQYGSFCTQLK^^ 

WAQVKQMYKTPTLKYFGGFNFSQILPDPLiKPTKRSFIEDLLFinCVTLADAGFMKQYGEC 

LGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQIPFAM 

QMAYRFNGIGVTQWLYENQKQIANQFNKAISQIQESLTTTSTAIjGKLQDVVNQNAQAIjN 

TLiVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRIilTGRLQSLQTYVTQQIilRAAEIRA 

SAmiAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGWFLHVTYVPSQERl^ 

ICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQI ITTDNTFVSGNCDWIGI INNTVYDP 

LQPELDSPKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESIilDL 

QEIjGKYEQYIKWPWYWLGFIAGLIAIVMVTILLCCMTSCCSCIiKGACSCGSCCKFDEDD 

S E PVLKGVKLH YT - TNLWI C L -DFLLLDQLLHS Q - KLTMLLLQVLFMLQQRYRYKPHS L S 

DGLLLALHFLLFFRALjPK- LRS IKDGS - PFIRASS SFAI YCC YLL.PS IHIFCLSLQVWRR 

NFCTSMP-YIFYNASTHVELIj-DVGFVGSANPRTHYFMMPTTIjFAGTHITMTT^ 

QIQLSLLKVTAFQHQNSKKTTKIiWILRIGTQVIiKTMSIiYMAiSPKFTTSLSIjHKL.IiQTIi 

VLKMLHSSSLTSIiLKTHRMCKYTQSTAIjQELLIQQWIQFMMSRRRIiIjACLCKHKK^ 

CTHSFRKKQVR LIAYPFFIiLSWYSC-SH-PSLLRFDCVRTAAILLT-V NQRFTSTR 

VLKI-TLLKEFLIFWSKRTNYYYYSVWNFNIAYHGRQRYYYR-GA-TTPGTMEPSN^ 
PSLDYVTTICLF-SEQVFVHNKACFPI^LVASm'CLFCACCCLQN-LGDWRDCDCNGLYC 
RLDVA- LLRC FLQAVC S YPLNWI QPRNKH S S QC AS PGDNCDQTAHGK - TCHWCCDHS WS 
LANGRTLPRAIj-H-GPAKElDHCGYITNAFIiIjQIRSVAACRH-FRFCCIQPIjPYWKIj--IKY 
RPRR-QRQYCFASTVSDNRCFILiL.TSRLQ-QRY-LSL-GIiSGLLFGIIjTL — VQ — DNYL 
SL-LRRI IRS -MMKNLWS - 1 IHKTNMKI ILFLTLiIVFTSCEIjYHYQECVRGTTVXiLKEPC 
PSGT YEGNS PFHPLADNKFALTCTSTHFAFACADGTRHTYQIiRARSVS PKLF IRQEEVQQ 

ELYSPLFIiIVAAIiVFIiIIiCFTIKRKTE-MSSL.-LTSICAF-PFCYSLF CLLYFGFHSK 

SRI-KNIiVPKSKRT-lSrPSL.F-IiVFLYAVAYAL-YSAVHL.INIiMCLKILVRYNTRGNTYST 
AWIjCALjGKVIjPFHRWHTMVQTCTP]WTINCQDPAGGALIARCWYL.HEGHQTAAFRDVIjW 
LNKRTN-NV — WTP I KPT -CP PHY I WWTHRFN- Q - PEWRTQWGKAKTAPTPRFTQ - YCVL 
VHS SHSAWQGGT- 1 P SRPGRSNQHQ- WSR- PNWDLPKS YPTS SWW-RQNERAQPQMVLLL 
PRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS — QCCHRATTSSRNNIAKRLL 
RRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-E 
QSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPN 
QTRM-LQTLJUy^JCTICSKGLCILWWTHWHGSHTFGNM^ 

AEQAH-RIQNIPTNRA-KGQKEKD — SSAFAAETKEAAHCDSSSCG-HG-FLiQTTSKFHE 

WSFC-FNSGIKTTHDDHTRQMGYVNVFAIPFTIHSLIiIiCKMNSKN-TAQVGL 

LIW-H-GGLERATTFSSRPRGWSRVQ-Il^GRAAYMEEP-CVKLIIiVVIjSPCDFNSFIi 
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FIGURE 25 

PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDWYRAFDIYNEKVAGFAKFLKTNCCRFQE 

lODEEGNLLDSYFVVKRHTMSKryQHEETIYOTiVKDCPAVAVHDFFKFRVDGDMVPHISRQ 

ADLVYALRHFDEGNCDTLKEILWYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTV 

QFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPrVDSYYSLLMPILTLTRALAAESH 

MDADI^PLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNVLFSTVFP 

PTSFGPLVRKIFVDGVPFWSTGYHFRELGVVHNQDVNLHSSRLSFJi:ELLVYAADPAMHAASGNLL 

LDKRTTCFSVAALTNNVAFQTWPGNF]m)FYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYD 

YYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMS 

YEDQDALFAYTKKWIPTITQMNLKYAISAKimARTVAGVSICSTMTNRQFH^ 

WIGTSKFYGGWHimLKTVYSDVETPHLMGWDYPKCDRAMPNMIiRIMASLVLARK^^ 

YRIJVNECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAWAN^ 

YVRNLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSiyMILSDDAWCYNSK^ 

VLYYQNlSrVFMSEAKCWTETDLTKGPHEFCSQHTJyn^VKQGDDYVYL 

DGTmiERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYW 

EPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVC 

NAPGCDVTDVTQDYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTN 

AGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVF 

TGYRWKNSKVQIGEYTFEKGDYGDAVWRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVR 

ITGLYPTLNI SDEFS SNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARI VYTACSHAAV 

DALCEKALKYLPIDKCSRIIPARARVECFDKFK\mSTLEQYVFCTVNALPETTADIWFDEISMAT 

ISryDLSVVNARLRAKHYWIGDPAQLPAPRTLLTKGTLEPEYFNSVCRIiMKTIGPDMFLGT^ 

EIVDWSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFI 

SPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCIWNRFNVAITRAKIGILCIMSD 

RDLYDKLQFTSLEIPRIUWATLQAENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDIP 

GIPKDMTYRRLISMMGFKMNYQWGYPIMFITREEAIRWRAWIGFDVEGCHATRDAVGTN^ 

GFSTGVlvnijVAVPTGYVDTElSINTEFTRWAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQJ^ 

GLSDRVVFVLWAHGFELTSMKYFWIGPERTCCLCDKRATCFSTSSDTYAC\AnKrHSVGFDYVYNPFM 

IDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWSVEYPIIGDELRVNS'- 

ACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATH 

HDKFTDGVCLFWNasrVT)RYPANAIVCRFDTRVLSNIJSn:iPGCDGGSLYVIIK^ 

KQLPFFYYSDSPCESHGKQWSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYIOMMISAG 
FSLWIYKQFDTYNLWNTFTRLQSLEWAYlSrWNKGHFIXSHA 

NKTTLPVNVAFELWAKRNIKFVPEIKILl^GVDIAANWIWDYKREAPAHVSTIGVCTM 

PTESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIGESVKTQF 

NYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKLEGYAFEHIVYGDFSH 

GQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKS 

QDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYPKLQASRAWQPGVAMPNLYKMQRMLLEKCDLQ 

]SnfGENAVIPKGIMMIWAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVIiR 

LVDSDLITOFVSDAYSTLIGDCATVHTANKWDLIISDimJPRTKHVTKENDSKEGFFTYLCGFIKQK 

LALGGSIAVKITEHSlAnsrADLYKLMGHFSWWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHA 

]SnfIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIRENNRVVVSS 

DILVNNJJl^TNMFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDT 

LFLPFYSIWTGFHTINHTFGNPVIPFKDGIYFAATEKSIWVRGWFGSTMNNKSQSVIIIN^ 

VIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKN 

KDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVG 

YLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQTSNFRWPSGDWRFPNITN 

LCPFGEWNATKFPSWAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSF 

VVKGDDVRQIAPGQTGVIADYTSHirKLPDDFMGCVLAWNTRNIDATSTGimTrKYRYLRHGKLRPFE 

DISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVI.SFELLNAPATVCGPKLSTDL 

IKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGKDVSDFTDSVRDPKTSEILDISPCAFGGVSVIT 

PGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIP 

IGAGICASYHTVSLLRSTSQKS I VAYTMSLGADS S I AYSNNTI AI PTNFS I S ITTEVMPVSMAKTS 

VDOKIMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQDRNTREWAQVKQMYKTPTLKYFGGFN 

FSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTD 
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DMIAAYTAALVSGTATAGWTFGAGAALQIPFJy^QMAYRFNGIGWQlWLYENQKQIANQFISrK^ 

IQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITG 

RLiQSLQTYVTQQLIRAAEIRASANIJ^TKMSECVTiGQSKRVDFCG 

TWPSQEKI^TTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTFVSGNCDWIG 
I INNTVYDPLQPELDSFKEELDKYFKNHTS PDVDLGDI SGINASWNIQKE IDKLNEVAKNIiNESL 
IDLQELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEP 
VLKGVKLHYT* 
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FIGURE 27 
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FIGURE 29 
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FIGURE 33 
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FIGURE 35 




FIGURE 36 

FIGURE 36A 
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FiGURE 37 
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FIGURE 38 
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FIGURE 40 
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FIGURE 41 
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FIGURE 43 
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FIGURE 44 
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FIGURE 43 
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FIGURE 50 
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FIGURE 52 
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FIGURE 54 
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FIGURE 57 
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FIGURE 60 
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FIGURE 62 




Xbal 3109 bp Pcil PcH 457 bp Sail 



ligate Xbal-Pdl 3109 bp+ PciI-SalI457 bp 
fragments into p893-l Xbal-Sall vector 



wo 2004/092360 



PCT/US2004/011710 



111/193 

FIGURE 63 
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FIGURE 65 

1 10 
MSDLDRCTTF 
AAGCTTACAAAACAAA ATG AGT GAG CTT GAG CGG TGC ACC ACT TTT 



DDVQAPNY 
GAT GAT GTT CAA GOT CCT AAT TAG 

30 

RGVYYPDE 
AGG GGG GTT TAG TAT CCT GAT GAA 



YLTQDLFL 
TAT TTA ACT CAG GAT TTA TTT CTT 

60 

GFHTINHT 
GGG TTT CAT ACT ATT AAT CAT ACG 



FKDG IYFA 
TTT AAG GAT GGT ATT TAT TTT GCT 

90 

VRGWVFGS 
GTC CGT GGT TGG GTT TTT GGT TCT 



SVI I INNS 
TCG GTG. ATT ATT ATT AAC AAT TCT 
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TGT AAC TTT GAA TTG TGT GAC AAC 



PMG T QTH T 
CCC ATG GGT ACA CAG ACA CAT ACT 
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AAT TGC ACT TTG GAG TAG ATA TCT 



SEKSGNFK 
TCA GAA AAG TCA GGT AAT TTT AAA 
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P F Y S N V T 
CCA TTT TAT TCT AAT GTT ACA 
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F G N P V I P 
TTT GGG AAC CCT GTC ATA CCT 
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A T E K S N V 
GCC ACA GAG AAA TCA AAT GTT 
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T M N N K S Q 
ACC ATG AAC AAC AAG TCA CAG 

110 

T N V V I R A 
ACT AAT GTT GTT ATA CGA GCA 

130 

P F F A V S K 
CCT TTG TTT GCT GTT TCT AAA 

140 

M I F D N A F 
ATG ATA TTG GAT AAT GCA TTT 
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D A F S L D V 
GAT GCC TTT TCG CTT GAT GTT 

170 

H L R E F V F 
CAC TTA CGA GAG TTT GTG TTT 



180 190 
KNKDGFLYVYKGYQP 
AAA AAT AAA GAT GGG TTT CTC TAT GTT TAT AAG GGC TAT CAA CCT 
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200 

IDVVRDLPSGFNTLK 
ATA GAT GTA GTT CGT GAT CTA CCT TCT GGT TTT AAC ACT TTG AAA 
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CCT ATT TTT AAG TTG CCT CTT GGT ATT AAC ATT ACA AAT TTT AGA 
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A I L T A F S P A Q D I W G T' 
GCC ATT CTT ACA GCC TTT TCA CCT GCT CAA GAC ATT TGG GGC ACG 
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TCA GCT GCA <3CC TAT TTT GTT GGC TAT TTA AAG CCA ACT ACA TTT 
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ATG CTC AAG TAT GAT GSAA AAT GGT ACA ATC ACA GAT GCT GTT GAT 
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360 



37G 



GVSATKLNDLCF SNV 
GGC GTT TCT GCC ACT AAG TTG AAT GAT CTT TGC TTC TCC AAT GTC 

380 

YADSFVVKGDDVRQI 
TAT GCA GAT TCT TTT GTA GTC AAG GGA GAT GAT GTA AGA CAA ATA 



390 



400 



APG QTGVIADYNYKL 
GCG CCA GGG CAA ACT GGT GTT ATT GCT GAT TAT AAT TAT AAA TTG 
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P D D F M G ' C 
CCA GAT GAT TTC ATG GGT TGT 

420 

I D A T S T G 
ATT- GAT GCT ACT TCA ACT GGT 



L R H G K L R 

CTT AGA CAT GGC AAG CTT AGG 

450 

V P F S P D G 

.GTG CCT TTC TCC CCT GAT GGC 



N C Y W P L N 
AAT TGT TAT TGG CCA TTA AAT 

480 

G I G y Q P Y 
GGC ATT GGC TAC CAA CCT .TAC 



L L N A P A T 
CTT TTA AAT GCA CCG GCC ACG 

510 

D L I K N Q C 
GAC CTT ATT AAG AAC CAG TGT 



T G T G V li T 
ACT GGT ACT GGT GTG TTA ACT 

540 

P Q Q F G R D 
TTT CAA CAA TTT GGC CGT GAT 



R D P K T S E 
CGA GAT CCT AAA ACA TCT GAA 

570 

F G G V S V I 
TTT GGG GGT GTA AGT GTA ATT 



E V A V L Y Q 
GAA GTT GCT GTT CTA TAT CAA 

600 

T A I H A D Q 
ACA GCA ATT CAT GCA GAT CAA 



410 

VLAWN TRN 
GTC CTT GCT TGG AAT ACT AGG AAC 

430 

NYNYKYRY 
AAT TAT AAT TAT AAA TAT AGG TAT 

440 

PFERDISN- 
CCC TTT GAG AGA GAC ATA TCT AAT 

460 

KPCTPPAL 
AAA CCT TGC ACC CCA CCT GCT CTT 

470 

DYGPYTTT 
GAT TAT GGT TTT TAC ACC ACT ACT 

490 

RVVVLSFE 
AGA GTT GTA GTA CTT TCT TTT GAA 

500 

VCGPKLST 
GTT TGT GGA CCA AAA TTA TCC ACT 

520 

VNFNFNGL 
GTC T^T TTT AAT TTT AAT GGA CTC 

530 

PSSKRFQP 
CCT TCT TCA AAG AGA TTT CAA CCA 

550 

VSDFTDSV 
GTT TCT GAT TTC ACT GAT TCC GTT 

560 

I LD IS PC S 
ATA TTA GAC ATT TCA CCT TGC TCT 

580 

TPG TNASS 
ACA CCT GGA ACA AAT GCT TCA TCT 

590 

DVNCTDVS 
GAT GTT AAC TGC ACT GAT GTT TCT 

610 

LTPAWRIY 
CTC ACA CCA GCT TGG CGC ATA TAT 
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620 

STGNNVFQTQAG CLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAEHVDT SYECDIPI 
GGA GCT GAG CAT GTT GAT ACT TCT TAT GAG TGC GAC ATT OCT ATT 

650 

G. A GICASYHTOC SEQ ID NOs 9T99 

GGA GCT GGC ATT TGT GCT AGT TAG CAT ACA TAA TGAGTCGAC SEQ ID NO: 9800 

Translated Mol. Weight = 72525.52 

FIGURE 66 

1 10 
MSDLDRCTTF 
AAGCTTACAAAACAAA ATG AGT GAC CTT GAC CGG TGC ACC ACT TTT 

20 

DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYP DEIPRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 

50 

YiT QD.LFI, PFYSN VT 
TAT TTA ACT CAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 

60 70 
GFHTINHTFGNPVIP 
GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT .GTC ATA CCT 

80 

FKDGIYFAATEKSN V 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 



90 



100 



VRGWVFGST MNNKSQ 
GTC CGT GGT TGG GTT TTT GGT TCT ACC ATG AAC AAC AAG TCA CAG 

110 

SVIIINNSTNVVIRA 
TCG GTG ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 



120 



130 



CNF ELCDN PF FAVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 

140 

PMG TQTHTMI FDNAF 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 

150 160 
NCTFEYI SDAFSLDV 
AAT TGC ACT TTC GAG TAC ATA TCT GAT GCC TTT TCG CTT GAT GTT 



BNSDOCID: <WO__20040923eCIA2J_> 



wo 2004/092360 PCT/US2004/011710 

116/193 

170 

SEKSGNPKHLREFVP 
TCA GAA AAG TCA QGT AAT TTT AAA CAC TTA CGA GAG TTT GTG TTT 

180 190 
KNKDGFLYVYKGYQP 
AAA AAT AAA GAT GGG TTT CTC TAT GTT TAT AAG GGC TAT CAA CCT 

200 

IDVVRDLPSGPNTLK 
ATA GAT GTA GTT CGT GAT CTA CCT TCT GGT TTT AAC ACT TTG AAA - 

210 220 
PIFKLPLGINIT NFR 
CCT ATT TTT AAG TTG CCT CTT GGT ATT AAC ATT ACA AAT TTT AGA 

230 

AILTAPSPAQDIWGT 
GCC ATT CTT ACA GCC TTT- TCA CCT GCT CAA GAC ATT TGG GGC ACG 

240 250 
SAAAYFVGYLKPT TF 
TCA GCT GCA GCC TAT TTT GTT GGC TAT TTA AAG CCA ACT ACA TTT 

260 

M LKYDENGT ITDAVD 
ATG CTC AAG TAT GAT GAA AAT GGT ACA ATC ACA GAT GCT GTT GAT 

270 280 
CSQNPIiAELKCSVKS 
TGT TCT CAA AAT CCA CTT GCT GAA CTC AAA TGC TCT GTT AAG AGC 

290 

FE. IDKG IYQ TSNFRV 
TTT GAG ATT GAC AAA GGA ATT TAC CAG ACC TCT AAT TTC AGG GTT 

300 310 
VPSGDVVRFPNITNIi 
GTT CCC TCA GGA GAT GTT GTG AGA TTC CCT AAT ATT ACA AAC TTG 

320 

C PFGEVFNAT.KF P SV 
TGT CCT TTT GGA GAG GTT TTT AAT GCT ACT AAA TTC CCT TCT GTC 

330 340 
YAWERKKI SNCVADY 
TAT GCA TGG GAG AGA AAA AAA ATT TCT AAT TGT GTT GCT GAT TAC 

350 

SVIjYNSTFFSTFKCY 
TCT GTG CTC TAC AAC TCA ACA TTT TTT TCA ACC TTT J\AG TGC TAT 

360 370 
GVSATKLNDLCFSNV 
GGC GTT TCT GCC ACT AAG TTG AAT GAT CTT TGC TTC TCC AAT GTC 

380 

YADSFVVKGDDVRQI 
TAT GCA GAT TCT TTT GTA GTC AAG GGA GAT GAT GTA AGA CAA ATA 

390 400 
A P G Q T GV I A D Y MY K L 
GCG CCA GGG CAA ACT GGT GTT ATT GCT GAT TAT AAT TAT AAA TTG 
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410 

PDDFMGCVLAWNTRN 
CCA GAT GAT TTC ATG GGT TGT GTC CTT GCT TGG AAT ACT AGG AAC 

420 430' 
IDAT STGNYNYKYRY 
ATT GAT GCT ACT TCA ACT GGT AAT TAT AAT TAT AAA TAT AGG TAT 

440 

LRHGKLRPFERDI SN 
CTT AGA CAT GGC AAG CTT AGG CCC TTT GAG AGA GAC ATA TCT AAT 



450 



460 



VPFS PDGKPCTPPAL 
GTG CCT TTC TCC CCT GAT GGC AAA CCT TGC ACC CCA CCT GCT CTT 

470 

NCYWPLNDYGFYTTT 
AAT TGT TAT TGG CCA TTA AAT GAT TAT GGT TTT TAC ACC ACT ACT 



480 



490 



GI.GYQPYRVVVLSFE 
GGC ATT GGC TAC CAA CCT TAC AGA GTT GTA GTA CTT TCT TTT GAA 

500 

LL .NAP ATVCGPKLST 
CTT TTA AAT GCA CCG GCC ACG GTT TGT GGA CCA AAA TTA TCC ACT 



510 



520 



DLIKNQ CVNFNFNGL 
GAC CTT ATT AAG AAC CAG TGT GTC AAT TTT AAT TTT AAT GGA CTC 

530 

TGTG VLTPSSKRFQP 
ACT GGT ACT GGT GTG TTA ACT CCT TCT TCA AAG AGA TTT CAA CCA 



540 



550 



FQQFGRDVSDFTDSV 
TTT CAA CAA TTT GGC CGT GAT GTT TCT GAT TTC ACT GAT TCC GTT 

560 

RDPKTSEILD ISPCS 
CGA GAT CCT AAA ACA TCT GAA ATA TTA GAC ATT TCA CCT TGC TCT 



570 



580 



FGGVSVITPGTNASS 
TTT GGG GGT GTA- AGT GTA ATT ACA CCT GGA ACA AAT GCT TCA TCT 

590 

EVAVLYQDVNCTDVS 
GAA GTT GCT GTT CTA TAT CAA GAT GTT AAC TGC AGT GAT GTT TCT 

600 610 
TAIHADQLTPAWRIY 
ACA GCA ATT CAT GCA GAT CAA CTC ACA CCA GCT TGG CGC ATA TAT 

620 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAEHVDTSYECDIPI 
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GGA GCT GAG CAT GTC GAC ACT TCT 



G AG I C AS Y 
GGA GCT GGC ATT TGT GCT AGT TAC 

660 

STSQKSIV 
AGT ACT AGC CAA AAA TCT ATT GTG 



ADSSIAYS 
GCT GAT AGT TCA ATT GCT TAC TCT 

690 

TNFSISIT 
ACT AAC TTT TCA ATT AGC ATT ACT 



MA KT S VDC 
ATG GCT AAA. ACC TCC GTA GAT TGT 

720 

STECANLL 
TCT ACT GAA TGT GCT AAT TTG CTT 



TQLNRAIiS 
ACA CAA CTA AAT CGT GCA CTC TCA 

750 

RNTREVFA 
CGC AAC ACA CGT GAA GTG TTC GCT 



T P T L K Y F G 
ACC CCA ACT TTG AAA TAT TTT GGT 

780 

IjPDPLKPT 
TTA CCT GAC CCT CTA AAG CCA ACT 



LLFNKVTL 
-TTG CTC TTT AAT AAG GTG ACA CTC 

810 

QYGECLGD 
CAA TAT GGC GAA TGC CTA GGT GAT 



CAQKFNGL 
TGT GCG CAG AAG TTC AAT GGA CTT 

840 

TDDMIAAY 
ACT GAT GAT ATG ATT GCT GCC TAC 



T ATAGWT F 
ACT GCC ACT GCT GGA TGG ACA TTT 

870 
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TAT GAG TGC GAC ATT CCT ATT 
650 

H T V S L L R 
CAT ACA GTT TCT TTA TTA CGT 

670 

A Y T M S L G 
GCT TAT ACT ATG TCT TTA GGT 

680 

N N T I A I P 
AAT AAC ACC ATT GCT ATA CCT 

700 

T E V M P V S 
ACA GAA GTA ATG CCT GTT TCT 

710 

N M Y I C G D 
AAT ATG TAC ATC TGC GGA GAT 

730 

L Q Y G S F C 
CTC CAA TAT GGT AGC TTT TGC 

740 

G I A A E Q D 
GGT ATT GCT GCT GAA CAG GAT 

760 

Q V K Q M Y K 
CAA GTC AAA CAA ATG TAC AAA 

770 

■ G F N F S Q I 
GGT TTT AAT TTT TCA CAA ATA 

790 

K R S F I E D 
AAG AGG TCT TTT ATT GAG GAC 

800 

A D A G F M K 
GCT GAT GCT GGC TTC ATG AAG 

820 

I N A R D L I 
ATT AAT GCT AGG GAC CTC ATT 

830 

T V L P P L L 
ACA GTG TTG CCA CCT CTG CTC 

850 

T A A L V S G 
ACT GCT GCT CTA GTT AGT GGT 

860 

G A G A A L Q 
GGT GCT GGC GCT GCT CTT CAA 

880 
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IPFAMQMAYRFNGIG 
ATA CCT TTT GCT ATG CAA ATG GCA TAT AGG TTC AAT GGC ATT GGA 

890 

VT QNVLYENQKQIAN 
GTT ACC CAA AAT GTT CTC TAT GAG AAC CAA AAA CAA ATC GCC AAC 

900 910 

QFNKAISQIQE SLTT 
CAA TTT AAC AAG GCG ATT AGT CAA ATT CAA GAA TCA CTT ACA ACA 

920 

TSTALGKLQDVVNQN 
ACA TCA ACT GCA TTG GGC AAG CTG CAA GAC GTT GTT AAC CAG AAT 

930 940 
A-QALNTLVKQLSSNF 
GCT CAA GCA TTA AAC ACA CTT GTT AAA CAA CTT AGC TCT AAT TTT 

950 

GAIS SVLNDIL SRLD 
GGT GCA ATT TCA AGT GTG CTA AAT GAT ATC CTT TCG CGA CTT GAT 

970 



960 



KVEAEVQIDRLITGR 
AAA GTC GAG GCG GAG GTA CAA ATT GAC AGG TTA ATT ACA GGC AGA 

980 

L Q S L Q T Y VT Q Q L I R A 
CTT CAA AGC CTT CAA ACC TAT GTA ACA CAA CAA CTA ATC AGG GCT 

. ^ 1000 
AEIRASANLAA. TKMS 
GCT GAA ATC AGG GCT TCT GCT AAT CTT GCT GCT ACT AAA ATG TCT 

1010 

ECVLGQSKRVD FCGK 
GAG TGT GTT CTT GGA CAA TCA AAA AGA GTT GAC TTT TGT GGA AAG 

1020 1030 
GYHLMSFPQAAPHG V 
GGC TAG CAC CTT ATG TCC TTC CCA CAA GCA GCC CCG CAT GGT GTT 

1040 

VFLHVTYVPSQERNF 
GTC TTC CTA CAT GTC ACG TAT GTG CCA TCC CAG GAG AGG AAC TTC 

1050 1060 
TTA PAICHEGKAYFP 
ACC ACA GCG CCA GCA ATT TGT CAT GAA GGC AAA GCA TAC TTC CCT 

1070 

REGVFVFNGTSWFIT 
CGT GAA GGT GTT TTT GTG TTT AAT GGC ACT TCT TGG TTT ATT ACA 

1080 1090 
QRNFFSPQIITTDNT 
CAG AGG AAC TTC TTT TCT CCA CAA ATA ATT ACT ACA GAC AAT ACA 

1100 

FVSGN CDVVIGIINN 
TTT GTC TCA GGA AAT TGT GAT GTC GTT ATT GGC ATC ATT AAC AAC 
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1110 112 
TVYDPLQPEL D SFKE 
ACA GTT TAT GAT CCT CTG CAA CCT GAG CTT GAG TCA TTC AAA GAA 

1130 

ELDKYPKNHTSP DVD 
GAG CTG GAG AAG TAG TTC AAA AAT CAT ACA TCA CCA GAT GTT GAT 



1150 
Q 



1140 

FGDISGINASVVN 
TTT GGC GAC ATT TCA GGC ATT AAC GCT TCT GTC GTC AAC ATT CAA 

1160 

KEIDRLNEVAKNLNE 
AAA GAA ATT GAC CGC CTC AAT GAG GTC GCT AAA AAT TTA AAT GAA 

1170 118 
SLIDLQELGKYEQYI 
TCA CTC ATT GAC CTT CAA GAA TTG GGA AAA TAT GAG CAA TAT ATT 

1183 

K W P OC SEQ ID NOs 9801 

AAA TGG CCT TAA TGAGTCGAC SEQ ID NO: 9802 



Translated Mol . Weight 



131315.20- 
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FIGURE 68 
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FIGURE 7i 
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FIGURE 73 
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FIGURE 76 







Endo H : - + 


+ 


gpi6o ^ '^^y^ 




1 2 


3 4 



FIGURE 77 



FIGURE 78 
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FIGURE 79 
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FIGURE 84 
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FIGURE 92 



FIGURE 93 
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FIGURE 103 
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FIGURE 109 

SI S1-S2 
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FIGURE 11i 




FIGURE i12 
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FIGURE 113 

5'3' Frame 1 

PKDMTYVDSSL-WS-ITKSMVTLICLSPMKLFWFVRGLALM-RAVMQLEML^ 
SS-DFLQVLT--LYRLVmjTLKITQNSPELMHKPPPVSSLNILYHSCIKACPGM-C^^ 
YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKERWCVTNVQLAFLLHQILMPA 
GIILWVLTMSITHL-LMFSSGGFTGNLSE-P-PTLPGTWKCTCGLWML - - 



5'3< Frame 2 

QRT-PT-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PT 
SPARIFYRC-LSSCTDWLC-H-K-HKIHQS-CTNLHQ-AV-TSYTTHV-RLALECSAY-D 
STl^Q-YTERIWQSRWPLGAWL-AYINEVLCQDWT-KNVLSV-QTCNLLFYFIRYLCL 
LESFCGF-LCL-PIYD-CSAVGALRVTFQSNHDQHCQVHGNAHVG-L-C 

5'3' Frame 3 

KGHDLRRLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 
LQLGFSTGVNLVAVPTGYVDTENNTKFTRWAQTSTSEQFKHLIPLMYKGLPV^^^ 
VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 
WNHSVGFDYVYNPFMIDVQQWGLYG-PFRVTMTNIARY^ . 

3"5' Frame 1 

-HHN-PTCAFPCTWQCWSWLL-KVTRKAPTAEHQS-MGYRHSQNPQNDSSRHKYIM 
KLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-AFVLS-YALHSRAS 
LYT-W-DV-TAHWWRFVH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-L 
HDSPLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYVGHVLW 

3'5' Frame 2 

SITTSPHVHFHVPGWGHGYSERLPWPPLLNINHKWVIDIVKTHRMIPAGISI--SRKA 
SCTFWQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITEHLYYLNTHYIPGQA 
FIHEWYKMFKLLTGGGLCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSC 
MTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDEST-VMSF 

3'5' Frame 3 

ASQLAHMCISMYLAMLVMVTLKGYP-SPHC-TSIINGL-T-SKPTE-FQQA-VSDEVEKQ 
VARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLSICTILIRTTFQGKP 
LYMSGIRCLNCSLVEVCALTLVNFVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA 
-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLRRSCPL 
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FIGURE 114 

5'3' Frame 1 

YI^LISMMGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATR^^ 
FSTGVmiVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

5 « 3 i Frame 2 

TVDSSL-WSK-ITKSMVTLICLSPAKKLFWFWGLAIM-RAWQL 
FLQVLT- -LYRLVMLTLKITQNSPELMQNLHQVTSLNILY 

5'3» Frame 3 

P-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PTSPARI 
FYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYT 

3'5' Frame 1 

GIRCLNWSPGGGFALTLWSVLFSVST-PVGTATKLTPVENPSlimGRLVPTASLVA-QPS:. 
TSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR 

3'5' Frame 2 

V-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-LHDSPL 
HQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYG 

3'5' Frame 3 

YKMFKLVTWWRFCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSCMTAL^ 
IKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTV 
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FIGURE 115 
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FIGURE 115 (contd,) 
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FIGURE 115 (contd,) 
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FIGURE 115 (contd.) 
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FIGURE 118 
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FIGURE lis (contd,) 
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FIGURE 119 

tagtcaaaacccacagaatgattccagcaggcataagtatctgatgaagtagaaaagcaa 

-VSD EVEKQ 
gttgcacgtTTGtcacacagacaacacgttctttcaggtccaatcTTGacaaagtacttc 

V A R _h_ S HRQHVLSGPI _L_ T K Y F 
attgatgtaagctcaaagccatgcgcccaaaggacgaacacgactctgtctgacaatcct 

IDVSSKPCAQRTNTTLSDNP 
ttcagtgtatcactgagcatttgtactatcttaatacgcactacattccagggcaagcct 

FSVSLSICTILIRTTFQGKP 
ttatacATGagtggtataagatgtttaaactgctcactggtggaggtttgtgcattaact 

L Y _M_ SGIRCLNCSLVE VCALT 
Ctggtgaattttgtgttattttcagtgtcaacataa SEQ ID NO: 10080 

LVNFVLFSVST - SEQ ID NO: 10027 



FIGURE 120 

FIGURE 120A 

PRHTQRT - PTVDS SL- WVSK- ITKSMVTLI CLS PAKKLFVTFVRGLALM-RAVMQ. 

LEMLWVLTYLSS-DFLQVLT--LYRLVMLTLKITQNSPEmQNLHQVTSLNILYH 

SCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKER 

WCVTNVQLAFLLHQILMPAGIILWVLTMSITHL-LMFSSGALRVTFRVTMTNIA 

RYMEMHMWLVVMLS-LDV-QSMSALLSALIGLLNTLL-Em-GLILLAEK^^ 

-SLHCLLISFQFFMT-EIQRLSSVCLRLK-NGSSTMLSHWTKLTK-RNSSILML 

YITINSLMVFVCFGIVTLIVTQPMQLCVGLTQESCQT-TYQAVMVWCM- 1 SMHS 

TLQLSIKVHLLI-SNCLSFTILIVLVSLMANK-CRILIMFHSNLLRVLHDAI-W 

LFADTMQMSTDSTM[HII--FLLDLAYGFT]SINLILITCGIHLPGYRV 

FIGURE 120B 

LGIPKGHDLP-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN 
-RCCGY-PTSPARIFYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYTT 
HV-RLALECSAY-DSTNAQ-YTERIVRQSRVRPLGAWL-AYINEVLCQDWT-KNV 
LSV-QTCNLLFYFIRYLCLLESFCGF-LCL-PIYD-CSAVGLYG-PSE-P-PTLP 
GTWKCTCG-L-CYHD-MFSSP-VLC-AR-LVC-IPYYRR-TEG-FCLQKSTTHGC 
EVCIAC—VSSSS-HRKSKGYQVCASG-SRMEVLRCSAM— QSLQNRGTLLFLCY 
TSR- IH-WCLFVLEL-R- SLPSQCNCV-V-HKSLVKLELTRL-WW-FVCE-ACIP 
HSSFR-KCIY-FKAIAFLLLF— SL-VSWQTSSVGY-LCSTQICYVYYTMQFRWC 
CLQTPCK-VPTVLGCI-YDDFCWI-PMDLQTI-YL-PVEYIYQVTEF 
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FIGURE 120C 

-AYPKDMTYRRLISIVMGFKMNYQWGYPNMFITR^ 

TNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMY^ 

RIKIVQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSD 

TYACWNHSVGFDYV^PFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAV 

HECFVKRVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVP 

QAEVEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVGRFD 

TRVLSNim,PGCDGGSLYV]SIKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWS 

IDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYK^^ 

FTRLQSL 

FIGURE 120D 

-TL-PGKCIPQVISIKLFVNP-AKSSRNHHIICIQVLSVLICMVSANSTT-IASCNTRSR 

FEWNIINIRHYLFAMRLTRTIRIWERQLL-ISKCTFIESWSVECMLIHIQTTTITAW-V 

QV-QDSCVKPTHNCIGWVTINWIPKQTNTISEFIVMYSIRIEEFLYWSFWT^ 

LPFYFSLRHTLDSLWISYVMKNWKLISKQCRLH3S[HVLYFSASRINPQFISYNRWNRPIN 

ALNKALMDC-TSSHDSITTSHMCISMYLAMLVMVTLKVTRKAP^ 

MIPAGISI— SRKASCTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITE 

HLYYLNTHYIPGQAFIHEWYKMFKLVTWWRFCINSGEFCVIFSWITSRYSY-VOT 

-LER-VSTHSISSCMTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTVG 
HVLWVCL 

FIGURE 120E 

KLCNLWVFHRL-VSNCL-IHRLNPAEIIILYASKYCRYSFAWCLQTAPPKLHRVIHVAD 

LSGT-SISDTTCLP-DSQGLSE--KKGNCFKLVNALLSKAGVWNACLFTYKLPPSQPGKP 

KFDKTLVSNLHTIALAG-RSTLQFQmCQTPSVNLS-CIA-E-KSSSIL-ALSLHG-AS-N- 

FHSTSA-GTHLIAFGFPMS-RTGNLSASNADFTTMCCTFLQAELTLSSSPIIGYSTDQST 

RLTKHSWTAKHLVMIASQLATCAFPCTWQCWSWLL-RLPVKPHC-TSIINGL-T-SKPTE 

-FQQA-VSDEVEKQVARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLS 

ICTILIRTTFQGKPLYMSGIRCLNWSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENP 

SWRGRLVPTASLVA-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR-V 
MSFGYA- 

FIGURE 120F 

NSVTW-MYSTGYKYQIVCKSIG-IQQKSSYYMHPSTVGTHLHGVCKQHHLNCIV-YT-QI 

-VEHNQYPTLLVCHETHKDYQNSKRKAIALN—MHFYRKLECGMHAYSHTNYHHHSLVSS 

SLTRLLCQTYTQLHWLGmDQRYNSKTNKHHQ-IYRDV-HKNRRVPLFCKLCHYMAEHRR^ 

SILLQPEAHT— PLDFLCHEELETYQQAMQTSQPCWLFCKQN-PSVHLL—GIQQTNQR 

A-QSTHGLLNI-S— HHN-PHVHFHVPGNVGHGYSEGYP-SPTAEHQS-MGYRHSQNPQN 

DSSRHKYLMK-KSKLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-A 

FVLS-YALHSRASLYT-W-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KIL 

AGEVG-YPQHL-LHDSPLHQSQSTHERDE-LLRG~TY-GNH-LGNSF-NPS-R-VYGRS 
CPLGMPR 
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FIGURE 121 



10 20 30 40 50 60 

I I 11 I I 

SEQ ID NO: 10033 TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

SEQ ID NO: 10084 CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAA 
Consensus TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 
Prim. cons. CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

70 80 90 100 110 ' 120 

I I I i i I 

SEQ ID NO: 10033 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
SEQ ID NO: 10084 TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
Cons ensus TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
Prim. cons. TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 

130 140 150 160 170 180 

SEQ ID NO.10033 ACGTTCGTGCGTGGAOTGGCTTTCATCTAGAGGGCTCTcl 

SEQ ID NO: 10084 ACGTTCGT6CGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
Consensus ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG. 
Prim. cons. . ACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTG6 



190 200 210 220 230 240 

I III I I 

SEQ ID NO: 10033 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
SEQ ID NO: 10084 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Cons ensus GTACTAACCTACCTCTCCAGCTAG6ATTTTCTACAGGTGTTAACTTA6TAGCTGTACC6A 
Prim . cons . GTACTAACCTACCTCTCCAGCTAG6ATTTTCTACAGGTGTTAACTTAGTAGCTGTACC6A 

250 260 270 280 290 300 

I 11 I I I 

SEQ ID NO: 10033 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
SEQ ID NO: 10084 CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
Consensus CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
Prim. cons. CTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGA6TTAATGCAAAACCTCCAC 



310 320 330 340 350 360 

I I I I I I 

SEQ ID NO: 10033 CAGGTGACCAGTTTAAACATCTTATACC 

SEQ ID NO: 10084 CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 
Consensus CAGGTGACCAGTTTAAACATCTTATACC 

Prim. cons. CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 



etc. 
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FIGURE 122 

5 • 3 • Frame 1 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaa 

PRHTQRT-PTVDSSL-WVSK 
tgaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtc 

-ITKSMV TLICLSPAKKLFV 
acgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgg 

TFVRGLALM-RAVMQLEML W 
gtactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccga 

VLTYL S S - DFLQVLT - -lyr 
ctggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccac 

LVMLTLKITQNSPELMQNLH 
caggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtag 

QV TSLNILYHSCIKACP GM- 
tgcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcg 

CVLR-YKCSVIH-KDCQTES 
tgttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattg 

CS S FGRMAL S LHQ - S TLS RL 
gacctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcag 

DLKERVVCVTNVQLAFLLHQ 
atacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatga 

ILMPAGIILWVLTMSITHL. - 
ttgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcc 

LMF S SGALRVTFRVTMTNIA 
aggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcag 

R YMEMHMWLV VML S - LDV - Q 
tccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatg- 

SMSALLSALIGLLNTLL-EM 
aactgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgc 

N-GL ILLAEKYN TWL - SLHC 
ttgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgc 

LLI SFQFFMT-EIQ RLSSVC 
ctcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttaca 

LRLK-NGSSTMLSHVVTKLT 
aaatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgttt 

K,-RNSS ILMLYITI NSLMVF 
gtttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttg 

VCFGIVTL IVTQPMQLCVGL 
acacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtga 

TQESCQT- .TYQAVMVVVCM- 
ataagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaat 

ISMHSTLQLSIKVHLLI-SN 
tgcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgg 

CLSFTILI VLVSLMANK-CR 
atattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtg 

ILIMFHSN LLRVLHDAI - VV 
ctgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatga 

LFADTMQMSTDSTWMHII-- 
tttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaata 

FLLDL AYGFTNNLILITCGI 
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catttaccaggttacagagttta 
H L P G Y R V 



5" 3' Frame 2 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaat 

LGIPKGHDLP-THLYDGFQN 
gaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtGa 

ELPSQWLP-YVYHPRRSYSS 
cgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtggg 

R S CVDWL - C RGL S.CN - RC C G 
tactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgac 

Y-PTSPARIFY-RC-LS SCTD 
tggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacc 

WLC-H-K-HRIHQ S-CKTST 
aggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagt 

R-PV-TSYTTHV-RLALECS 
gcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgt 

AY-DSTNAQ-YTERIVRQSR 
gttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgg 

VRP D GAWL-AYI NEVLCQDW 
acctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcaga 

T-KN VL SV-QTCNLLFYF IR 
tacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgat 

YLCLLE SFCGF-LCL-PIYD 
tgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcca 

-CS A VGL YG-PSE-P-P TLP 
ggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagt 

GTW.KCTCG-L -C YHD -MFS S 
ccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatga 

P-VLC -AR-LVC-IPYYRR- 
actgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgct 

TEG-FCLQKSTTH GCEVC IA 
tgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcc 

C--VSSSS-HRKSKGYQVCA 
tcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaa 

SG-SRMEVLRCSAM--QSLQ 
aatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttg 

NRGTLLFLCYTS R- IH-WCL 
tttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttga 

FVLEL- R-SL PSQCNCV-V- 
cacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaa 

HKSLVKLELTRL-WW^FVCE 
taagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaatt 

-ACIPHSSFR-KCIY-FKAI 
gcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgga 

AFLLLF--SL-VSWQTSS VG 
tattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgc 

Y-LCS TQ ICYVYYTMQFRWC 
tgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgat 
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CL QT PCK-V P TVLGC I - YDD 
ttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaatac 

FCWI -PMDLQTI - YL-PVEY 
atttaccaggttacagagttta 

I Y Q V T E F 



5'3' Frame 3 

cctaggcatacccaaaggacATGacctaccgtagactcatctctatgatgggtttcaaaatg 

-AYPKDMTYRRLISMMGFKM 
aattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtcac 

NYQVNGYPNMFITREEAIRH 
gttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgggt 

VRAWI G F DVE G C HATRDA VG 
actaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgact 

T. NLPLQLGFSTGVNLVAVPT 
ggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacca 

GYVDTENNTEFTRVNAKPPP 
ggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagtg 

GDQFKHLIPLM YKGLPWNVV 
cgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgtg 

RIKIVQML S'DTLKGLSDRVV 
ttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgga 

FVLWA HGFELTSMKYFVKIG 
cctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcagat 

PERTCC LCDK RA TCFSTSSD 
acttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgatt 

TYAC WNHSVG-FDYVYNPFMI 
gatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgccag 

DVQQWGFTG N LQSNHDQHCQ 
gtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagtc 

VHGNAHVASC DA IMTRCLAV 
catgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatgaa 

HECFV KRVDWSV EYP I IGDE 
ctgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgctt 

liRVNSACRKVQHMVVKSAL L 
gctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcct 

A DKFPVLHDIGNPKAIKCVP 
caggctgaagtagaatggaagttctacgatgcfccagccatgtagtgacaaagcttacaaa 

QAEVEWKF YDAQPC SDKAYK 
atagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttgt 

lEELPYSYAIHHDKFTDGVC 
ttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttgac 

LFWNCNVDRYPANAi'vCRFD 
acaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaat 

TRVLSNLNLPGCDGGSLYVN 
aagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaattg 

KHAFHTPAFDKSAFTNLKQL 
cctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcggat 

PFFYYSDSPCESHGKQVVSD 
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attgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgct 

I DYVPLK SATC I TRCN LGGA 
gtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgatt 

VC RHHANEYRQY L DAYNMM I 
tctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaataca 

SAGFSLWIY KQFDTYNLWNT 
tttaccaggttacagagttta 

F T R L Q S L 



3*5* Frame 1 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaat 

-TL-PGKCIPQVISIKLFVN 
ccataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactc 

P-AKSSRNHHIICIQVLSVL 
atttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcaga 

ICM VSANSTT-IASCNTRSR 
tttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggact 

FEWNI INIRHYLFAMRLTRT 
atcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagc 

IRIVKERQLL-I S KCTFIES 
tggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagtt 

WSVECMLIHIQTTTI TAW-V 
caagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatc 

QV - .QD S CVKPT HNC I GWVT I 
aacgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcata 

NVT.I PK Q TNT I S E F IVMYS I 
agaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaa 

RI EEFLYFVSFVTTWLSIVE 
cttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatg 

liPFYFSLRHTLDSLWISYV M 
aagaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttct 

KNWKLI SKQCRLHNHVLYFS 
gcaagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaac 

ASR.INPQFISYNRVFNRPIN 
gcgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagc 

ALNKAL. MDC-TS SHDSITTS 
cacatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgt 

HMCIS M YLAMLVMVTL KVTR 
aaagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacaga 

KAPLLNINHKWVIDIVKTHR 
atgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcaca 

MIPAGISI--SRKASCTFVT 
cagacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaa 

QTTRSFRSNLDKVLH-CK LK 
gccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgag 

AMRP KDEHDSV-Q SFQCITE 
catttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtat 

HLYYLNTHYI PGQAF IHEWY 
aagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtt 
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KMFKLVTWWRFCIN SGEFCV 
attttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcc 

IFSVNITSRYSY-VNTCRKS 
tagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatc 

-LER-VSTHSISSCMTALYI 
aaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagg 

KANPRTNVTNSFFAGDKHIR 
gtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggt 

VTIDLVIHFETHHRDESTV -G 
catgtcctttgggtatgcctagg 

H V L W V C L 



3 '5' Frame 2 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatc 

KLCN LVNVFHRL-VSNCL - I 
cataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactca 

HRLNPAEI IILYASKYCRYS 
tttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagat 

FAWCLQTAPPKLHRVIHVAD 
ttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggacta 

LSG.T-S ISDTTC LP-DSQGL 
tcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagct 

SE - - KK GNCFKLVNALLS K A 
ggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttc 

GVWNACLFTYKLPPSQPGKF 
aagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatca 

K F D KTLVSNLHTIA L A G - R S 
acgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataa 

TLQFQNKQTPSVNLS-CIA- 
gaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtaaaac 

E-KSSSIL-ALSLHG-AS- N 
ttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatga 

FHSTSA- GTHLIAFGF PMS- 
agaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctg 

RTGNLSASNADFTTMCCTFL 
caagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacg 

QAELTLSSSPIIGYST DQST 
cgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcc 

RLTKHSW TAKHLVMIASQLA 
acatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgta 

TCAFPCT. WQCWSWLL-RLP V 
aagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaa 

KPHC-TSIINGL-T-SKPTE 
tgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacac • 

-FQQA-VSDEVEKQVARLSH 
agacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaag 

RQHVL SGPILTKYFIDVS SK 
ccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagc 

PCAQRTNTTLSDNPFSVSLS 
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atttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtata 

ICTILIRTTFQ GKPLYMSGI 
agatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtta 

RCLNWS PGGG FALTLVNSVL 
ttttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcct 

FSVST - PVGTATKLTPVENP 
agctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatca 

SWRGRLVPTASLV A-QPSTS 
aagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattaggg 

KPIHART-RIASSRVINILG 
taaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtc 

-PLTW-FILKPIIEMSLR-V 
atgtcctttgggtatgcctagg 

M S F G Y A - 



3* 5' Frame 3 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatcc 

NSV TW-MYSTGYKYQ IVCKS 
ataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactcat 

I G - I Q Q K S S YYM H P S TV6 T H 
ttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagatt 

L HGV CKQHHL NCIV-YT-QI 
tgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggactat 

-VEHNQYPTLLVCHETHKDY 
cagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagctg 

Q NS KRKAIALN - - MHFYRKL 

gagtgtggaat^catgcttattcacatacaaactaccaccatcacagcctggtaagttca 

ECGMHAYSHTNYHHHSLVSS 
agtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatGaa 

SLTRLLCQTYTQ LHWLGNDQ 
cgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataag 

R YNSKTNKHHQ - lYRDV-HK 
aatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaact 

NRRVPLFCKLCHYMAEHRRT 
tccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatgaa 

SILLQPEAHT--PLDFLCHE 
gaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctgc 

ELETYQQAMQT SQPCVVLFC 
aagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacgc 

KQN - P S V HL L - - G I Q Q TN Q R 
gcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcca 

A-QSTHGLLNI-S--HHN- P 
catgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgtaa 

HVHFHVPGNVGHGYSEGYP- 
agccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaat 

SPTAEHQS-MGYRHSQNPQN 
gattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacaca 

DSSRHKYLMK-KSKLHVCHT 
gacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaagc 
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DNTFFQVQS -Q STSLM-AQS 
catgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagca 

HAPKGRTRLCLTILSVYH-A 
tttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtataa 

FVLS-YALHSRAS LYT-VV~ 
gatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgttat 

DV-TGHLVEVLH-LW - ILCY 
tttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatccta 

FQCQHNQSVQLLS-HL-KI -L 
gctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatcaa 

AGEVG-YPQHL-LHDSPLHQ 
agccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagggt 

SQSTHERDE-LLRG--TY-G 
aaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtca 

NH-LGNSF-NPS-R-VYGRS 
tgtcctttgggtatgcctagg 

C P L G M P R 
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FIGURE 123 



CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCAAMTGMmCCMGTCMTC^ 

i N..N i N 

TACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTCACGTTCGTGCGTGGAmGCTTTGATGTAGA 

i N N 

TGCAACTAGAGATGCTGTGGGTACTAACCTACCTCTCCAGCTAGGATTTTCTAC^^ 

N 

CTGGmTGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAM 

N.... N. 

CmTACCACTCATGTATAAAGGCTTGCCCTGGAATGTAGTGCGTATTAAGATAGTACAAAT^ 

N ...N N 

AGGAmTCAGACAGAGTCGTGTTCGTCCTTTGGGCGCATGGCTTTGAGCmC^ 

N N 

GAGCTGAAAGAACGTGmTCTGTGTGACAAACGTGCAACTTGCTTTTCTACTTC 

N 

CATTCTGTGGGTTTTGACTATGTCTATAACCCATmTGAmATGTTCAGCAGTGGGGCTT^ 

N N N 

TAACCATGACCAACATTGCCAGGTACATGGAAAT6CACATGTGGCTAGTTGTGATGCTATCATGACTA6ATC 

N N N N N i N 

TCCAmGTGCTTTGTTAAGCGCGTTGAmGTCTGTTGAATACCCTAmTA^^ 

...N N 

TGCAGAAAAGTACAACACATGGmTGAAGTCTGCAmCTTGCTGATAAGTTTC^^ 

i N.' 

AAAGGCTATCAAGTGTCTGCCTCAGGCTGAAGTAGAATGGAAGTTCTACGATGCra 

N N N 

AAATAGAGGAACTCTTCTATTCTTATGCTATACATCACGATAAATTCACTGATGGTGm 

N N 

6TTGATCi3TTACCCAGCCAATGCAATT6TGTGTAGGTTT6ACA(^ 

N N 

TGGTGGTAGTTOTATGTGAATAAGCATGCATTCCACACTCCAGCTTTCGATAAAAGTGCATTTACTAATT^ 

N N 

TGCCTTTCTTTTACTATTCTGATAGTCCTTGmGTCTCATGGCAAACAAGTAGTGTCGGATAT^ 

i N 

AAATCTGCTACGTGTAmCACGATGCAATTTAGGTGGTGCTGTTTGCAGAC^^ 

N N N 

GGATGCATATAATATGATGATTTCTGCTGGATTTAGCCTATGGATmCAAACAAm 

..N N. .N N 

CATTTACCAGGTTACA6AGTTTA SEQ ID NO: 10084 
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Sequences : 



74827 I pir I |VFIHJH genome polyprotein lb - murine hepatit. 
14917044 I sp I P29982 | RRPB_CVMJH RNA-directed RNA polymeras . 
26007546|ref |NP_068658.2| ORFlab polyprotein [Murine hep. 



AF208066_2 RNA-directed RNA polyme. 
AF201929_2 RNA-directed RNA polyme. 
RNA-directed RNA polymerase [muri. 



7769342 gb AAF69332.1 
6625761 gb AAF19384.1 
2641128 gb AAB86818.1 
4377413 

133592 |sp|pi6342 |RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE 
26008080 
15077820 
18033972 



emb|CAA36202.1| open reading frame lb {AA 1-2733. 



ref |np_150073.2| orf lab polyprotein [Bovine cor. 
gb| AAK83365 . 1 1 replicase [bovine coronavirus] 
gb|AAL57305.1 I replicase [bovine coronavirus] 
7769353|gb|AAF69342.l|AF208067_2 RNA-directed RNA polyme 

17529672 ■ ' ' 

25121571 
26008092 
10242469 
14149033 



ref 
ref 
ref 
emb 



gb|AAL40397.l|AF220295_2 RNA polymerase lb [bov. 
■np_740618.1| coronavirus nspll [Murine hepa. 
NP_742140.l| coronavirus nspll [Bovine coro. 
NP_066134.1| ORFlab polyprotein; frameshift. 
CAC39112.1| replicase polyprotein lab [Avia. 
458735|emb|CAA83018.l| potential chimeric protein [Avian. 
133594|sp|P26314|RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (. 
29293454|gb|AAO67706.l| ORFlb polyprotein [Avian infecti . 
25121555 I ref |np_740631.1 I coronavirus nspll [Avian infec. 
9635157|ref |NP_058422.1| replicase . [Transmissible gastro. 
19387582 I ref Inp_598309.1 I . Poll [porcine epidemic diarrhe. 
12175747 I ref |np_073549 .1 I replicase polyprotein lab [Hvim. 
133591 1 sp I P18458 ] RRPB_BEV RNA-directed RNA polymerase (0. 
1513061|dbj |BAA13323.1| cyanoprotein alpha subunit precu. 
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Alignments 



>gi|74827|pir| jVFIHJH genome polyprotein lb - murine hepatitis virus 
(strain JHM) 

Length = 2731 

Score = 638 bits (1645), Expect .= 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMPITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATRDSIGTNFPLQ 1644 

Query; 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPIMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 

Sbjct: 1645 LGFSTGIDFVVEATGMFAERDGYWKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1704 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGKEWCSVCNKRATCFNSRTGYYGCWR 1764 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1824 
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Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIK 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPIISNEVSVNTSCMJLQRVMFRAJiMLC^ 1882 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPMTAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KFYDASPV---VKSWQFVYKYEAHKDQFLIX3LCMFWNC1WDKYPANAVVCRFDTRVLSK 1939 

Query: 366 L^^JPGCIX3GSLYVNKHAFHTPAFDKSAFT^^KQLPFFYYSDSPCESHGKQWSDIDYVP^. 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1940 LNLPGCaJGGSLYVNKHAFHTNPFTRAAFENLKPMPFPYYSDTPCVYl^ 1999 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYWmiSAGFSLWIYKQFDTYNLWNTFTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITROILGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi I 14917044 | sp | P29982 | RRPB_CVMJH RNA-directed RNA polymerase (ORFlB) 
gi|758332l|gb|AAA46458.2| open reading frame lb [murine hepatitis virus] 
. Length = 2731 

Score = 637 bits (1644), E3cpect = 0.0 

Identities = 287/481 (59%), Positives =366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMJYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTl^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTO PLQ 
Sbjct: 1585 WYSRLISmGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATKDSIGlTSIFPLQ 1644 

Query: 66 LGFSTGVl^VAVPTGYVDTENOTEFTRWAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 

Sbjct: 1645 LGFSTGIDFVVEATGMFAERDGYWKKAAARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1704 

Query: 126 IXLSIXrLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSiyrYACTAtt^ 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 1764 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1824 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VM++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVM(m.EYPIISNEVSWTSCRLLQRYMFRAAMLCNRYDVCYDIGNPKGLACVKG--YpF 1882 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FVJNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KFYDASPV---VKSVKQPVYKYEAHKDQFLDGLCMFVffNCNVDKYPANAWCRFDTRVLS^ 1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 1999 
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Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFnTY^ 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYlWATTAGFTFWVlrtCTFDFYl^ 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi|26007546|ref |NP_068668.2l ORFlab polyprotein [Murine hepatitis virus] 
Length = 7178 

Score = 637 bits (1644), Expect =0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRMjISMMGFKMlSryQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGT^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 6032 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 6091 

Query: 66 IK3FSTGVNLVAVPTGYVDTENOTEFTRVNAKPPPGDQFKHLIPLMYKGLPW1^^ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPIiM +G W+WRI+IVQ 

Sbjct: 6092 LGFSTGIDFVVEATGMFAERIX3YVFKKAAARAPPGEQFKHLIPIJ1SRGQKM5VVRIRIVQ 6151 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCPKRATCFSTSSnTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 6152 MLSDHLADLADSVVLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 6211 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 6212 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 6271 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 6272 SVNWNLEYPIISNEVSVlSrrSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLAC^ — YDP 6329 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 6330 KFYDASPV---VKSVKQFVYKYEAHKDQFLDGLCMFWNC3^KYPANAWCRFOT 6386 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 6387 Ll^PGOJGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 6446 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFa^ 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 6447 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 6506 

Query: 486 L 486 
L 

Sbjct: 6507 L 6507 

>gi|7769342|gb|AAF69332.l|AF208066_2 RNA-directed RNA polymerase (murine 
hepatitis virus] 

Length = 2732 
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Score = 637 bits (1644), Expect =0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRiy^ISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGT^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN FLO 
Sbjct: 1586 WYSRLISmGPKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSlGTNFPLQ 1645 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQ 125 

LGPSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRl+IVQ 

SbDCt: 1646 LGFSTGIDFVVEATGMFAERDGYWKKAVARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1705 

Query: 126 MLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIGPERTCCLOJKRATCFSTSSiyrY 185 

MLSD L L+D WV WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
SbDCt: 1706 MLSDHLVDIADSVVLVTWAASFELTCLRYFAKVGKEWCSVCNKRATCFNSRTGYYGCWR 1765 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sb3Ct: 1766 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDLICSVHKGAHVASSDAIMTRCLAVHDCFCK 1825 

Query : 2 46 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
SbDCt: 1826 SVNWSLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG-:-YDF 1883 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 • 

KFYDA P +++ . Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

SbDCt: 1884 KFYDASPV---VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 1940- 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PPFYYSD+PC • +DYVPL 

Sbjct: 1941.LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2000 

Query: 426 . KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA +YR+YL++YN +A6F+ W+YK FD YNLWNTFTRLQS 

SbDCt: 2001 RSATCITRCNLGGAVCLKHAEDYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2060 

Query: 486 L 486 
L 

Sbjct: 2061 L 2061 

>gi|662576l|gb|AAFl9384.l|AF201929_2 RNA-directed RNA polymerase [laurine 
hepatitis virus strain 2] 

gi|7739595|gb|AAF68920.l|AF207902_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain ML-11] -r i 

Length = 2733 

Score = 637 bits (1643), Expect =0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
ot-. . +TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 

SbDCt: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
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Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAVARM»PGEQFKHLWLMSRGQKWD^ 1706 

Query: 126 MLSOTLKGLSDRVVFVLWAHGFELTSMKYFWIGPERTCCLCDKRATCFSTSSiyiYAC^ 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1707 MiSDHLVDl^SVVLVTWAASFELTCLRYFiyCVGKEWCSVCNKI^TCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVH^ 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CP K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSmmLICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVlMSLEYPIISNEVSVNTSCRLLQRWRAAMLCmYDVCYDIGNPKGLAC^ 1884 

Query: 306 KPYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCMVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVIj+ 

Sbjct: 1885 KFYDASPV VKSVKQFVYKYEAHKDQFIJDGLCMFWttiaJVDKYPANAW 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 l^PGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 42,6 KSATCITRCNLGGAVCRHHANEYRQYLDAYNmiSAGFSLWIYKQFDTYlSnjWNTFTRLQS 485 

+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLG6AVCLKHAEDYREYLESYOTATTAGFTFWVYKTFDPYNLWNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi 1 2641128 |gb|AAB86818.1 1 RNA-directed RNA polymerase [murine hepatitis 
virus] 

Length = 2733 
Score = 635 bits (1637), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD4-+GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYWKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLADLADSWLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHIWVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
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Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRWRAAMLCNRYDVCYDIGNPKGIA 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVL^^ 365 

KFYDA P +++ y y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYDASPV---VKSVKQFVyKYEAHKDQFLDGLCMFWNC]^ICYPANAWCRFDTRV^ 1941 

Query: 366 IJJLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 imPGOJGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVD^ 2001 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYlSIllMISAGFSLWIYKQFDTYNLl^ 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YMLWNTFTRLQS 
Sbjct: 2002 RSATCITRClSn^GAVCLKHAEEYREYLESYlWATTAGFTFWVYKTFDFYmiWOTFTRLQ 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi|4377413|einb|CAA36202.l| open reading frame lb (AA 1-2733) [Murine 
hepatitis virus] 

Length = 2733 

Score = 634 bits (163.6), Escpect = 0.0 

'Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 57481 (1%) 

Query: 6 MTYRRLISMMGFKm^QVNGYPMyiFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVlSTLVAVPTGYVDTEmTEFTRWAKPPPGDQFKHLIPLMYKGLPWNVVRI 125 

.LGFSTG++ V TG + f + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIR^^ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGPELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNVittJLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGOT^ 1884 

Query: 306 KPYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FV3NCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYDASPV---VKSVKQFVYKYEAHKDQFLDGLC]^VMCNVDKYPANAWCRFDTRVLNK 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
■ Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 426 KSATCITRCNLG6AVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 



BNSDOCID: <WO___^0040923eQA2J_> 



wo 2004/092360 



16V19S 



PCT/US2004/011710 



Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYKTATTAGFTFWYKTFDFYNLliJNTFTR^ 2061 
Query: 486 L 486 
Sbjct: 2062 L 2062 



>gi|l33592|sp|Pl6342|RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE (ORFIB) 

gi|93916|pir| |S15760 genome polyprotein - murine hepatitis virus (strain 
A59) - 
Length =2733 

Score = 634 bits (1636), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 ■M'YRRLISmGFKMNYQWGYPNMFITREEAIRHVIUiWIGFDVEGCHAT^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 WYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVDTEMJTEFTRWAKPPPGDQFKHLIPLMYKGLPWIWVRIKIVQ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

Sbjct: 1647 LGFSTGIDFVVEATGMFAERIX3YWKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSmYFVKIGPERTCCLCDKRATCFSTSSDITACWN 185 

MLSD L L+D W V WA FELT .++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSV6FDYVYNPFMIDVQQWGFTGOT.QSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DI6NPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMiCNRYDVC^ 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFVmCNVDRYPAimVCRFDT^^ 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYDASPV VKSVKQFVYKYEAEnODQFLDGLCMFWNClSrinDKYPA^ 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 426 KSATCITROKn^AVCRHHANEYRQYLDAYNMMISAGPSLWIYKQPDTYNLWOT 485 

+SATCITRCNLG6AVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLG6AVCLKHAEEYREYLESYOTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi|26008080|ref |np_150073.2| orflab polyprotein [Bovine coronavirus] 
Length =7094 

Score = 633 bits (1633), Expect = e-180 



BNSDOCID: <WO___200409236QA2J_> 



wo 2004/092360 PCT/US2004/011710 

163/193 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMfYQWGYPmFITREEAIRHVRAWIGFDVEGCHATMDAVGTNLPLO 65 
ov.- co.o "^^^ RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLO 
Sbjct: 5948 WYSRLISLMGFKLDVTUXSYCKLFITOEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTEl^EFTRVNi^PPPGDQFKHLIPW^ 125 

u LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVO 

SbDCt: 6008 LGFSTGIDFVVEATGLFiUDIUX3YSFKKAVAKM>PGEQFKHLlPIMRGQRWDVW^ 6067 

Query: 126 Ml-SDTLKGLSDRVVFVLWMIGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW^ 185 
o^.- . .... ^ ^ LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
SbjCt: 6068 MFADHLIDLSDCVVLVTWAANFELTCIJIYFAKVGREISCNVCTKRATAYNSRTG^ 6127 

Query: 186 HSVGFDYVYNPPMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIIOTRCLAVHECFVK 245 
o,- . ..no ^^''^^ ++D+QQWG+ G+L SNHD +C VH AHVAS nAIMTRCLAV++CF 

SbDCt: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAim'RCLAVYDCFa!^ 6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
o^- . ^.oo ++W+VEYPII +EL +N++CR +Qi +++K+A+L +++ + +DIGNPKAI CV + ++ 
SbDCt: 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCmYTLCYDIGNPKAIAW^ 6245 

Query: 306 I^YDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRV^^ 365 
cK-i^^- coAc S™S^ ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

SbDCt: 6246 KFYDAQPI---VKSVKTLLYSFEAHKDSFKIX3IOIFWNCNVDKYP^^ 6302 

Query: 366 IJ^LPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
cv.. LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 

Sbjct: 6303 U^LPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSOTPCVYMDG 6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYN]yMISAGFSLWIYKQFI>ITfNLW^^ 485 

. ' KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LOS- 

Sb3Ct: 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFT^^^ 
Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi|l5077820igb|AAK83365.l| replicase [bovine coronavirus] 
Length =7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTK^ 
■ cu- ^ ^n.n '^'^ RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLO 
SbDct: 5948 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLraGLPWl^ 125 
a^' u .n.o ^^^STG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVO 

Sb^ct: 6008 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQPKHLIPLMTRGQRvSZpR^^^ 6067 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

. M +D L LSD WV WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 

Sbjct: 6068 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 
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186 HSVGFDYVYNPFMIDVQQWGFTGl^QSNHDQHCQWGNAHVASCDAIMTRCLAVHECFVK 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
6128 HSVTCDYLYNPLIVDIQQW6YIGSLSSNHDLYCSVHKGAHVASSDAim^lCLAVYIX:FCN 6187 

246 RVDWSVEYPIIGDEIJlWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCn/PQAE 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DICTIPKAI CV + ++ 
6188 NINWNVEYPIISNELSIOTSCRVLQRViynJKAMLCNRYTLCYDIGNPKAIACV-- 6245 

306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

6246 KFYDAQPI VKSWTLLYSFEAHKDSFKIX3LC34FWNQIVDKYPPNAWCRFDTRV^^ 6302 

366 LNLPGCIXSGSLYVlSrKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

UyJLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
6303 IJ^PGCNGGSLYVNECHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYmXS]^ 6362 

426 KSATCITRCmGGAVCRHHZ^YRQYLDAYNmiSAGFSLWIYKQPDTYNLT/iOT 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
6363 KSATCITROSLGGAVCLKHAEEYREYLESYOT'ATTAGFTFWVYKTFDFYNLWNTFTKLQS 6422 

486 L 486 

L . . 
6423 L 6423 



>gi|18033972 |gbiAAL57305.1| replicase [bovine coronavirus] 
Length =7094 

Score = 633 bits (1633), Expect = e-180 

Identities .= 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 - MTYRRLISmGFKiyttJYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGPK+4 ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENlOTEFTRWAKPPPGDQFKHLIPmYKGLPWN^ 125 

LGFSTG++ V TG + F + AK. PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 6008 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPIiMTRGQRWDVVRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRVWVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCWL\mflAAl^FELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR. 6127 

Query: 186 HSVGFDYVYNPFMIDVQQW6FTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFYK 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVHflr 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINVJNVEYPIISNELSINTSCRVLQRVMLKAAiyLCMlYTLCYDIGNPKAI^^ 6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAWCRFDTRVLNN 6302 



Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
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Query: 366 I^JLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
SbDCt: 6303 I^JLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSOTPC^^ 6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAY]^ISAGFSLWIYKQFDTYM.WNTFTRIX)^ 485 
c.^' . ... KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDPY]>n^WOT 6422 
Query: 486 L 486 
L 

Sbjct: 6423 L 6423 

hepitlSfvi^uit^'^'^'-'-''^'''^''-' ^^-^^^^^^^^ polymerase [murine 

Length = 2733 

Score = 633 bits (1633), Expect = e-180 

Identities = 285/481 (59%>, Positives =364/481 (75%), Gaps = 5/481 (1%) 

MTYRRLISMMGFKimYQWGYPNMFITREEAIimVRAWIGFDVEGCHATRDAVG 65 
++Y RLIS+MGFK++ ++Gy +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
VSYSRLISLMGFKLDLTl^GYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

LGPSTGVNLVAVPTGYVDTENOTEFTRVNAKPPPGDQFKHLIPIJy[YK^ 125 

V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVq' 

I/5FSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 1706 

MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +e KRATCF++ + Y CW 

MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 1766 

HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
SLo^Sr™^ ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

RVDWSWPIIGDELRWSACaiKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAE^ 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
SVNWNLEYPIISNEVSVOTSCmLQRWRAAMLCmYDVCYDIGNPK^ 1884 

KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 



LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sb3Ct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGIffiS 2001 

KSATCITRCmXSGAVCRHHANEYRQYLDAYNlMISAGFSLWIYKQFDTYNLWNTFT 485 
+SATCITRCNLGGAVC HA EYR+YL++YN . +AGF+ W+YK FD YNLWNTFTRLQS 



Query: 


6 


Sbjct: 


1587 


Query: 


66 


Sbjct: 


1647 


Query: 


126 


Sbjct: 


1707 


Query: 


186 


Sbjct: 


1767 


Query: 


246 


Sbjct: 


1827 


Query: 


306 


Sbjct: 


1885 


Query: 


366 


Sbjct: 


1942 


Query: 


426 


Sbjct: 


2002 


Query: 


486 


Sbjct: 


2062 
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>gi|17529672]gblAAL40397.1|AF220295_^2 PNA polymerase lb [bovine 
coronavirus] 

Length = 2685 

Score = 623 bits (1607), Expect = e-177 

Identities = 282/481 (58%), Positives = 365/481 (75%), Gaps = 5/481 (1%) 

Query: 6 M'YKIU^ISmGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATIUDAVGTK^ 65 

+TY RLIS+MGFK++ ++6Y +PIT+EEA++ VRAW+GFD EG HATRD++GTN -SIQ 
Sbjct: 1574 VTYSRLISmGFKIiDVTLIX3YCKLFITKEEAVKRVRAWGFDAE(M 1633 

Query: 66 LGFSTGWLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLMYKGLPWNTV^ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

Sbjct: 1634 LGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPIJyiTRGQRWDVVRP 1693 

Query: 126 MjSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C + KRAT +++ + Y CW 
Sbjct: 1694 MPADHLIDLSDCVVLVTWAANFELTCLRYPAKVGREISCNVSTKRATAYNSRTGYYGCWR 1753 

Query: 186 .HSVGFDYVYNPFMIDVQQWGFTGlSnjQSiraDQHCQVHGNAHVASm 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIlirRCLAV++CP 
Sbjct: 1754 HSWCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGJ^ASSDAIMTRCLAVYDCPCN 1813 

Query: 246 RVpWSVEYPIIGDELRWSACRKVQHMSAOCSALLADKFPVLHDIGNPKAIKCW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 1814 NINWNVEYPIISISELSINTSCRVLQRVMLKAM^C^ — KDFDF 1871 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTIXSVCLFWNCNVDRYPANAIIVCRFDTRVLSN 365" 

KFYDAQP ++ L Y + H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct:. 1872 KFYDAQPI VKSVKTLLYFFEAHKDSFKIX3LCMFWNCTJVDKYPPNAW(3^ 1928 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

•LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 1929, LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMD^ 1988 

Query: 426 KSATCITROILGGAVamHANEYRQYLDAYNmiSAGFSLWIYKQFDTYNLTffl^^ 485 

KSATCITRCNLGGAVC HA EYR+YL+4-YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 1989 KSATCITRCNLGGAVCLKHAEEYREYLESYOTATTAGFTFWVYKTFDFYNLWIWFT^^ 2048 

Query: 486 L 486 
L 

Sbjct: 2049 L 2049 



>gi 125121571 |ref|NP_740618.1 I coronavirus nspll [Murine hepatitis virus] 
Length =521 

Score = 622 bits (1603), Expect = e-177 

Identities = 284/479 (59%), Positives = 362/479 (75%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 107 



Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLMYK^^ 125 
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LGFSTG++ V T6 + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 

SbDCt: 108 I^FSTGIDFVVEATGMFAERTCYWKKAAARAPPGEQPKHLIPLMSRGQKl^DVVRIRIV^ 167 

Query: 126 MLSDTLKGLSDRVVFVLWZ^GFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC^ 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
SbDCt: 168 MLSDHLADIJmSVVLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYyGCS^ 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
SbDCt: 228 HSYSCDYLYNPLIVDIQQWGYTGSLTSNfTOPICSVHKGAHVASSDAIMTRCLAVHDCFCK 287 

Query: 246 RVIMSWPIIGDEmVNSACRKVQHlWVKSALIJUJKFPVLHDIGNPKAIKCVPQAEVE^ 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 288 SVNWNLEYPIISNEVSWSCRIiQRWRAAMLCNRYDVCYDIGOT^ 345 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

SbDCt: 346 KPYDASPV---WSVKQFVYKYEAHKDQFLDGLCMFWNCI^KYPM[AWCRF]^ 402 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 403 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYM^ 462. 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYUlAYNMMISAGFSLWIYKQFDTYmiW^ 484 

• _ +SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQ 

SbDCt: 463 RSATCITRCajLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQ 521 

>gi|26008092|ref {NP_742140.1| coronavirus nspll [Bovine coronavirus] 
Length = 521 

Score = 617 bits (1590), Expect = e-175 

Identities = 282/479 (58%), Positives = 365/479 (76%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGO^ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW4GFD EG HATRD++GTN PLQ 
SbDCt: 48 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGW 107 

Query: 66 LGFSTGVI^VAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPW^^ 125 
^1-. ^ LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 

SbDCt: 108 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 167 

Query: 126 M,SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 

M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
SbDCt: 168 MFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCWCTKRATAYNSRTGYYGCWR 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
SbDCt: 228 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 287 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
SbDCt: 288 NINWNVEYPIISNELSINTSCRVLQRVMLKAMILCNRYTLCYDIGNPKAIACV- 345 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFOTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRPDTRVL+N 
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Sbjct: 346 KFYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAWCRFDTRVLN^ 402 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 403 Ll^PGCNGGSLYVNKHAFHTKPFSIU^EHLKPMPFFYYSDTPCVYlvmGMDAKQW 462 

Query: 426 KSATCITROSn^GAVCRHHAl^YRQYLDAyNMMISAGFSLWIYKQFDTYmiWOT 484 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQ 
Sbjct: 463 KSATCITROJLGGAVCLKHAEEyREYLESYOTATTAGFTFimmCTFDFYmiT^ 521 



>gi|l0242469|ref |np_066134.1| ORFlab polyprotein; frameshift product [Avian 
infectious bronchitis 
virus] 
Length =6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISmGFKWQWGYPIMFITREEAIRHVRAWIGFDV^ 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 E±TYKHLISLI/3FKMSVNVEGCaiI04PITRDEAIRNVRGWVGFDVE^ 5574 

Query: 65 QLGFST6VNLVAVPTGYVOTE^I1^EFTRVNAKPPPGDQFK^ 124 

Q+GFST6 + V P G VDT F VN+K. PPG+QF HL L PW+V+R +IV 

Sbjct: 5575 QVGFSTGADFVVTPEGLVDTSIGl^EPVNSKAPPGEQFiraLRVLFKSAKPV^^ 5634 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFWIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMLM)NL<ajVSIXrVVr\m*CHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSH^ 5693 

Query: 185 iraSVGFDYVYNPFMIDVQQWGFTGNLQSNTOQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQW6++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHamiGHAHVASVDAIMTRCLAINNAF^ 5753 

Query: 245 KRVIMSVEYPIIGDEIiRVNSACTUCVQHMVVKSALIJUDKFPVLHDIGNPKAIKCV^ 304 

+ V+W + YP I +E VNS+Cai +Q M + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNVTOLTYPHIANEDEVNSSCRYLQRMYLNACVDALKN^^ 5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFVmOJVDRYPMJAIVCRFDTRV^ 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 5814 FRFYDKNPIVRNVKQFE YDYNQHKDKFAIXSLailFWNajVlXryPDNSIjVCRYDTRNLS 5870 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 5871 VFKn^PGCMGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DL^ 5929 

Query: 425 LKSATCITROQLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNL 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct! 5930 LATKDCITKCmGGAVCKKHAQMyAEFWSYNAAWAGFTFWVTNiaL.NP 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 
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>gi 1 14149033 I emb I CAC39112.il replicase polyprotein lab [Avian infectious 
bronchitis virus (strain 
Beaudette CK) ] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYPWITREEAIRHViyVWIGFDVEGCH^^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTin:iP 
Sbjct: 5515 EITYKHLISIi6FKMSVNVEGCHNMFITRDEAIR]WRGWVGFD^ 5574 

Query: 65 QI^FSTGVNLVAVPTGYVOTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWN^ 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

SbDCt: 5575 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 5634 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
SbDCt: 5635 QMIJU}NLCWS]X:WFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GF.D+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
SbDCt: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNIKJFNHDLHCNVHGHAHVASVDAIMrRCLAIK^ 5753 

Query: 245 KRVDWSWPIIGDEmWSACRKVQHMVVKSALMDKFPVIJroiGNPK^ 304 

+ YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
SbDCt: 5754 QDVNWDLTYPHIAmDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
r^' ^ eo. . P + .E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sb]Ct: 5814 FRFYDKNPIVRNVKQFE---YDYNQHKDKFADGLCMFWNCNVDCYPDNSLW^ 5870 

Query: 365 • NLMiPGCIXSGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 5871 Wl^PGCNGGSLYVNKHAFYTPKFDRISFRNIjy^FFFYDSSPCETIQVDGVAQ-DLVS 5929 

Query: 425 LKSATCITRCNIX^KSAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWN^^ 484 

^ + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
SbDCt: 5930 lATKDCITKCNIGGAVCKKHAQMYAEFWSYNAAWAGFTFWVT^^ 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 

>gi|458735|enib|CAA83018.ll potential chimeric protein [Avian infectious 
bronchitis viirus] 

Length = 2155 

Score = 570 bits (1470), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
SbDCt: 1596 EITYKHLISLLGFKMSVNVEGOINMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 1655 
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Query: 65 QLGFSTG^VAVPTGYVDTENNTEFTRWi^PPPGDQFKHLIPIMyKGLPWNVVia 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 1656 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLPKSJ^PWHVTRPRIV 1715 

Query: 125 QMLSiyTLKGLSDRVVFVLWJmGFELTSMKYFVKIGPERTCCLCDKRATCPSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1716 QMLADNLO^VSDCVVFWWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYAC^ 1774 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++- F 
Sbjct: 1775 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASV^^ 1834 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALl^KFPVLHDIGNPKAIKCVPQAEVE 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1835 QDVNWDLTYPHIANEDEWSSCRYLQRMYLNACVDALKVNWYDIGNPKGIKC^^ 1894 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTIXSVCLFWNa^IVDRYPANAI^^ 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1895 FRFYDKNPIVRNVKQFE YDYWQHKDKFADGLCMFllTOCNVDCYPDNSLVCRYDTRNLS 1951 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQ^^ 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1952 VFNLPGOQGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVIXSVAQ-DLVS 2010 

Query: 425 LKiSATCITRCNIiGGAVCRHffiiNEYRQYLDAYNMlOSAGFS^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 2011 lATKIXIITKCNIGGAVCmmQMYAEFVTSYNAAWAGFTPWVTNKLNPYNLV^^ 2070 

Query: 485 SL 486 
S+ 

Sbjct: 2071 SI 2072 



>gi|l33594|spiP26314|RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (ORFIB) 

gi 174826 |pir I |WIHB2 genome polyprotein - avian infectious bronchitis 
virus (strain 

Beaudette) 

gil292953|gb|AAA70234.l| pel protein [Avian infectious bronchitis virus] 
gi|331173|gb|AAA46224.l| ORFlb [Avian infectious bronchitis virus] 
Length = 2652 

Score = 570 bits (1469), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYIOILISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEAT^ 1597 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVR 124 

Q+GFSTG + V P G VDT F VN+K PPG+QP HL L PW+V+R +IV 

Sbjct: 1598 QVGFSTGADFWTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 1657 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCWSDCVVFVTWCTGLELTTLRYFViaGKEQVCS-CGSRATOFN 1716 
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Qiiery: 185. JfflSVGFDYVYNPFMIDVQQWGFTGNLQSNHIXJHCQVHGNMIVASC^^ 244 

.n.^ +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
SbDCt: 1717 KHCIXSFDFVYNPLLVDIQQWGYSGNLQFNHDIJiC^IVHGHAHVASVDA^ 1776 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSi^IJ^KFPVLHDIGNPKAIKCVPQAEVE 304 
^■u ■ . , ^"^^ + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
SbDCt: 1777 QDVl^LTYPHIANEDEWSSCRYLQRimjaACVDALKVNV^ 1836 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVOIFDTO 364 
ow u ^ ^ Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
SbDCt: 1837 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 1893 

Query: 365 NimPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

,o«. NLP6C+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

SbDct: 1894 VFNLPGCNGGSLYVNiaiAFYTPKFDRISFRm.KAMPFFFYDSSPCETIQVD6VAQ-DLVS 1952 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYM,!?^ 484 
ov.- ^ .n.o ^"*' CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+l2 
Sb3Gt: 1953 IATKIX:iTKaaiGGAVCKKHAQM5fAEFVTSYNAAWAGFTFWVTNKIJJPYM^ 2012 
Query: 485 SL 486 
S+ 

Sbjct: 2013 SI 2014 . 

>gi|29293454|gb|AAO67706.l| ORFlb polyprotein [Avian infectious bronchitis 
Length = 2649 
Score = 565 bits (1455), Expect = e-160 

Identities = 261/482 (54%), Positives = 342/482 (70%), Gaps = 8/482 (1%,) 

DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 • 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 

EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 1597 

QLGFSTGVNLVAVPTGYVDTENNTEFTRWAKPPPGDQFKHLIPLMYKGLPWN^ 12 4 

Q+GFSTG + V P.G +DT F VN+K PPG+QF HL L PW+V+R +IV 

QVGFSTGADFWTPEGLIDTSIGNNFEPVNSKAPPGEQFNHLRALFKSAKPWHVIRPRIV 1657 

QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 

QMIjmNLCNVSDCWFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 1716 

NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 

H +G VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DA+MTRCLA++ F 
RHCLG VYNPLLVDIQQWGYSGNLQFmJLHCNVHGHAHVASADAVMTRCLAINNAFC 1773 

KRVDWSVEYPIIGDELRVNSACRKVQHIWVKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 
K V+W ++YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
KDVNWELQYPHIANEDEVNSSCRYLQRMYLNACVDALKVNWYDIGNPKGIKCVRRGDVN 1833 

WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 3 64 
^ ^ Y Y+ H DKF DG+C+FVffNCNVD YP N++VCR+DTR LS 



Query: 


5 


Sbjct: 


1538 


Query: 


65 


Sbjct: 


1598 


Query: 


125 


Sbjct: 


1658 


Query: 


185 


Sbjct: 


1717 


Query: 


245 


Sbjct: 


1774 


Query: 


305 


Sbjct: 


1834 
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Query: 365 NIJJLPGaXMSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQW 424 

NLPGC+GGSLYVNKHAFHTP FD+ +P NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 1891 VFNLPGOJGGSLYVNKHAFHTPKFDRISFROTJCAMPFFFYDSSPCETIQVDGVAQ-^^ 1949 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNmiSAGFSLWIYKQFDTYNLWNTFTRLQ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ F+ YNLW F+ LQ 
Sbjct: 1950 LATKDCITKCNIGGAVCKKHAQMYAEFWSYNAAWAGFTFWVTNN^ 

Query: 485 SL 486 

S+ - - - 

Sbjct: 2010 SI 2011 



>gi|25121555jref jNP_740631.l| coronavirus nspll [Avieui infectious 
bronchitis virus] 

Length =521 

Score = 559 bits (1440), Expect = e-158 

Identities :: 261/480 (54%), Positives = 342/480 (71%), Gaps = 5/480 (1%) 

Query: 5 DMTYIUU^ISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGC 64 

++TY+ LIS++GFKM+ V 6 NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 47 EITYKHLISLLGFKMSVNVEGCHIMPITRDEAIRNVRGWGFDVEATHACGT^ 106 

Query: 65 QLGFSTGVl^VAVPTGYVDTENNTEFTRVN^^PPPGDQFKI^ 124 

Q+GFST6 + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 

Sbjct: 107 QVGPSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFimLRVLFKSAKPWHVIRPRIV 166 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 

QML+D Ii +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 167 QMLADMXaOVSDCWFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRAOT 225 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244= 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 226 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAINNAFC 285 

Query: 245 KRVDWSVEYPIIGDELRWSACRKVQHMVVKSAIJ^MJKFPVIJroiGNPKAIKCVPQA^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 286 QDVNWDLTYPHIANEDEWSSCRYLQRMYLNACVDALKVNV^^ 345 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364. 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 346 FRFYDKNPIVRNVKQFE YDYNQHKDiai"ADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 402 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAPTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 

Sbjct: 403 WNLPGOSrGGSLYVlOaJAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 461 

Query: 425 LKSATCITROJLGGAVCRHHANEYRQYLnAYWMMISAGFSLWIYKQFIZra^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 462 LATmZIITKCOTGGAVCKKHAQMYAEFVTSYNAAWAGFTFWVTNia^NPYNLm 521 



>gi|9635157 jref |NP_058422.1| replicase [Transmissible gastroenteritis 
virus] 

gi|7801348 jeirib|CAB91143.1| replicase [Treinsmissible gastroenteritis virus] 
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Length = 6685 
Score = 545 bits (1403), Expect = e-153 

Identities = 261/484 (53%). Positives =335/484 (69%), Gaps = 13/484 (2%) 

Query: 4 KDMTYRRLISMMGFKMNYQWGYPmFITI^EAIRHVRAWIGFDVEGCHATRDAVG^ 63 
OK- ^ ^ ^^^"^ ^ +P A+R+VRAW+GFDVEG H D VGTN+P 

SbDCt: 5574 KDVKYANVISYMGFRFEMriPGYHTLFCTraFAMRNVRAW^^ 5533 

Query: 64 LQLGFSTGVNLVAVPTGYVDTENNTEPTRWAKPPPGDQFKHLIPLimCGLPW^ 123 
OK- ^ T^o. ^Q^^^ V G V TE V A+ PPG+QF HLIPLM KG PW++VR +1 

Sb3Ct: 5634 LQI^PSNGVDPWQTEGCVITEKGNSIEVVKARAPPGEQFAHLIPLMRKG^ 5693 

Query: 124 VQM.SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIG-PERTCCLCDKRATCFSTSSDTYA 182 

VQM+ D GLSD ++FVLWA G ELT+M+YFVKIG P++ C C K ATO+q+q va 
Sb3ct: 5694 VQMVCDYFDGLSDILIFVLWAGGLELTTMRYFVKIGRPQK--CECGKSATCYSSSQSV^A 5751 
Query: 183 CWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAH^ 242 
QK-!..^ c;^.o DY+YNP+ ID+QQWG+TG+L NH + C +H N HVAS DAIMTRCLA+H+C 

Sb3Ct: 5752 CFKHALGCDYLYNPYCIDIQQWGYTGSLSMNHHEVCNIHRNEHVASG^ 5811 

Query: 243 FV^^mWSVEYPIIGDEmWSAC^^ 302 

OK- V co.o ^^^WS+ YP I +E ++N A R VQ V+K+AL +HD+GNPK I+C 

SbDCt: 5812 FVKRXmWSIVYPFIDNEEKINKAGRIVQSHVMKAAIJaFNP^^ 5870" 

Query: 303 VEWKFYDAQPCSDKAYKIEEI^SYAIHHDFI-TIXSVCLFWN^ 362 
01-. ^ ^ "^"^ + L Y y +H +G+ LFWfJCNVD YP +IVCRPDTR 

SbDCt: 5871 IPWFC:YDRDPINN---NWCIJDYDYMVHGQ--MN6I^FWNCNVD^^^ 5925 

Query: 363 LSNl^PGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSb 422 
oK-'l coo. S L+L GC+GG+LYVN HAFHTPA+D+ AF LK +PFFYY DS CE V +Y 

Sbjct: 5926 RSKLSLEGCNGGALYVNNHAFHTPAYDRRAFAKLKPMPFFYYDDSNCE- LVDGQPNY 5981 

Query: 423 ^LKSATCITRCNLGGAVCRHHANEYRQYLDAYWmiSAGFSLWIYKQFDT^ 482 
cK- '. cnoo ^^^^ CIT+CN+GGAVC+ HA- YR Y++ YN+ + AGF++W + FDTY LW+ F 
Sb^ct: 5982 VPLKSNVCITKCNIGGAVCKKHAALYRAYVEDYNIFMQAGFTIWCPQNFI^ 6041 
Query: 483 LQSL 486 
++L 

Sbjct: 6042 SKAL 6045 

li^^fo^cnl^vf ^-^ll [porcine epidemic diarrhea virus! 
gxil3752450|gb|AAK38661.l| Poll [porcine epidemic diarrhea vi^^^^^ 

Score = 541 bits (1394). E3qpect = e-152 

Identities = 256/480 (53%), Positives = 334/480 (69%), Gaps = 12/480 (2%) 

Query: 8 YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHAT^^ 67 
c*^- \ ^ + + + +F TR+ A+R+VR W+GFDVEG H VGTN+PLOLG 

Sb^ct: 5675 YEHVISFMGFRFDINIPNHHTLFCTRDFAimrVRGWLGFDVEGAH^^ 5734 

Query: 68 FSTG\^VAVPTCYVDTE]^EFTRVNM^ 127 

^ ^ ^ P G V TE+ V A+ PPG+QF HL+PL+ +G PW+WR +TVOM 

Sb3ct: 5735 FSNGVDFVVRPEGCVVTESGDYIKPVRARAPPGEQFAHI^PLr^G^^ 5794 
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Query: 128 SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCXaDKRATCFSTSSDTYACV^ 187 

SD L LSD ++FVLWA 6 ELT+M+YFVKIGP ++C C K ATC++++ TY C+ H+ 
Sbjct: 5795 SDYLANLSDILIFVLWAGGLELTTMRYFVKIGPSKSCD-CGKVATCYNSALHTYCCFKHA 5853 

Query: 188 VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRV 247 

+G DY+YNP+ ID+QQWG+ G+L NH +HC VH N HVAS DAIMTRCLA+H+CFVK V 
Sbjct: 5854 LGCDYLYNPYCIDIQQWGYKGSLSLiraHEHaWHI^HVASGDAIMTRCIJ^IHDCFV^ 5913 

Query : 248 DWSVEYPIIGDEI^WSACRKVQHMVVKSALIJUDKFPVLHDIGNPKAIKC^ 307 

DWS+ YP IG+E +N + R VQ ++S L ++DIGNPK I+C + +W 

Sbjct: 5914 DWSITYPFIGNEAVIlKSGRIVQSHTMRSVLKLYNPKAIYDIGNPKGIRCyi-VTDAKWFC 5972 

Query: 308 YDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNajVDRYPANAIVORFOTRVLSN^ 367 

+D P + +E Y Y I H +F DG+CLFWNCNVD YP , ++VCRFDTR S LN 
Sbjct: 5973 FDKNPTNSNVKTLE YDY-ITHGQF-DGLCLFWNCNVDMYPEFSWCRFDTRCRSPLN 6027 

Query: 368 LPGCDGGSLYVimiAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKS 427 

L GC+GGSLYVN HAFHTPAFDK AF LK +PFF+Y D+ C+ ++ I+YVPI1++ 
Sbjct: 6028 LEGCWGGSLYVNNHAPHTPAFDKE?AFAiaiKEMPFFPYDDTECD KLQDSINYVPLRA 6083 

Query: 428 ATCITRCNLGGAVCRHHANEYRQYIJDAYNMMISAGFSLWIYKQFDTYl^V^ 486 

+ CIT+CN+GGAVC H Y- Y++AYN SAGF++W+ FDTYNLW TF+ LQ L 
Sbjct: 6084 SNCITKCWGGAVCSKHCaMYHSYVNAYOTFTSAGFTIWVPTSFDTYNLWQTFS^ 6143 

>gi|l2175747|ref |np_073549.1| replicase polyprotein lab [Human Goronavirus 
229E] 

gi 1 12082740 |gb|AAG48591.1 1 AF304460_2 replicase polyprotein lab [Hiiman 
coronavirus 229E] : 

Length = 6758 

Score = 535 bits (1379), Expect = e-151 

Identities = 254/478 <53%) , Positives = 329/478 (68%), Gaps = 13/478 (2%) 

Query: 7 TYRRLISMMGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEG 66 

TY +IS MGF+. + + G ++F TR+ A+RHVR W+G DVEG H T D VGTN+PLQ+ 
Sbjct: 5642 TYEHVISYMGFRFWSMPGSHSLFCTRDFAMRHVRGVKiGMDVEGAHVTGDNVGTNVPLQV 5701 

* Query: 67 GFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNV\na 126 
• GFS GV+ VAPGVT + VA+ PPG+QF H++PL+ KG PW+V+R +IVQM 
Sbjct: 5702 GFSN6VDFVAQPEGCVLTNTGSVVKPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQM 5761 

Query: 127 LSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSIW^ACVJIOT 186 

++D L G SD +VFVLWA G ELT+M+YFVKIG + C C ATC+++ S+ Y C+ H 
Sbjct: 5762 lADFLAGSSDVLVFVLWAGGLELTTMRYFVKIGAVKH-CQCGTVATCYNSVSNDYCCFKH 5820 

Query: 187 SVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV^ 246 

++G DYVYNP++ID+QQWG+ G+L +NH C VH N HVAS DAIMTRCLAV++CFVK 
Sbjct: 5821 ALGCl>YVYNPYVIDlQQWGYVGSLSTmmiaiVHRNEHVASGDAIMTRCLAVYDCFVK^ 5880 

Query: 247 VDWSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWK 306 

VDWS+ YP+I +E +N R VQ ++++A+ +HDIGNPK I+C + +W 

Sbjct: 5881 VDWSITYPMIANENAINKGGRWQSHIMRAAIKLl^ 5939 



Query: 



307 FYDAQPCSDKAYKIEELPYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVaiFDTRVLSNL 366 
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^. YDP+ +E YYH DG+CLFWNCNVD YP +IVCRFDTR S L 

Sbjct: 5940 CYDKNPINSNVKTLE---YDYMTHGQ--MDGLCLFWNCNVDMYPEFSIVCR^ 5994 

Query: 367 NLPGCDGGSLYVNKHAFHTPAPDKSAFTNLKQLPFFYYSDSPCESHGKQWSD-IDYVPL 425 

NL G +GGSLYVN HAFHTPA+DK A LK PFFYY D CE WD ++YVPL 

Sbjct: 5995 NLEGVNGGSLYVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGSCE WHDQVNYVPL 6049 

- Query: 426 KSATCITRCNLGGAVCIUIHANEYRQyLDAYNmiSAGFSLWIYKQFDTYNL 483 
++ CIT+CN+GGAVC HAN YR Y+++YN+ A6F++W+ FD YNLW TFT + 
SbDCt: 6050 RATNCITKCNIGGAVCSiOIANLYRAYVESYmFTQAGFNIWVPTTro 6107 



133591|sp|Pl8458|RRPB_BEV RNA-directed RKA polymerase (ORFIB) 
94017|piri IS11238 polymerase - Berne virus 
virus^^^^^'^'^^^^^^'^' 2nd polymerase reading frame (AA 1-2291) [Berne 
Length = 2291 



>gi 
gi 
gi 



Score = 50.1 bits (118), Escpect = 8e-05 

Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 11/103 (10%) 

Query : 14 0 FVLWAHGFELTSMKYFVKIGPERTC--CLCDKRATCFSTSSDTYACWNHJ5VGF--DyVYN 195 

F+L++ 4L S+K++V+ TCCC+AC +YCN G +N 

Sb3Ct: 1511 PILYSCSNDLKSLKFYVEFD---TCYFCSCGEMAICmRDGN-YKCRNCYGGMLISKLVN 1566 

Query: 196 PFMIDVQQWGFTGNLQSNHDQHC-QVHGNAHVASCDAIMTRCL 237 
■ . +DVQ+ LQ HD C Q HG++H A CaDA+MT+CL 

Sb3Ct: 1567 CKYLDVQKERV— KLQDAHDAICQQPHGDSHEALCDAVMTKCL 1607 

claiatusr^'"^^ '^^"^^^'^' alPl^a subunit precursor (Riptortus 

Length = 693 

Score =34.7 bits (78), Expect =3.7 

Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%) 

Query: 371 CDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYX-SD 406 

C G LY +KHA P FD+ A+ + Q+P FY+ D 
Sbjct: 643 CGGSKLYDSKHAMGFP-FDRPAYPDAFQVPNFYFKD 677 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Apr 11, 2003 2:30 AM 
Numbfer of letters in database: 454,141,287 
Number of sequences in database: 1,411,415 

Lambda K H 

0.325 0.139 0.456 



Gapped 

Lambda K H 

0.267 0.0410 0.140 
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Matrix: BIiOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 473,361,261 

Number of Sequences: 1411415 

Number of extensions: 20503315 

Number of successful extensions: 51018 

Number of sequences better than 10.0: 27 

Number of HSP's better than 10,0 without gapping: 26 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 50937 

Number of HSP's gapped (non-prelim): 33 

length of query: 486 

length of database: 454,141,287 

effective HSP length: 127 

effective length of query: 359 

effective length of database: 274,891,582 

effective search space: 98686077938 

effective search space used: 98686077938 

T: 11 

A: 40 

XI: 15 ( 7.0 bits) 
X2: -38 (14.6 bits) 
X3: 64 (24.7 bits) 
SI: 40 (21.6. bits) 
S2: 75 (33.5 bits) 



CNSOOCID: <WO___2004092360AaLl_> 
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FiGURE125 

FIGURE 125A 



-2- 



^00 ^ao ^40 \m ' 180 ' 200 ' 220 ' 240 ' 260 ' 280 ' 300 'sL ' 340 ' 360 ' 3^ ' 4*^ 



FIGURE 125B 



i '20 U'o 'c'o 'e'o 'lOo'lL'llo'llo'l^'2io'2k'2io'2L'2io'3io'3^o'3io'3io'3^o"4io'4^o'4lo' 
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FIGURE 126 

5'3' Frame 1 

QVHQ]WCVL-LIFYLMTLSR--SHKICQ-FQKWSRLQLTMLKFHSCFGVIM3MLKPST^^ 

YKQVKRGNQVLRCLTCTRCKECFLKSWFRIMVKMLLYQKE---MSQSIIiNCV]S^ 

-LYPPT-ELFTLVL 

5 '3* Frame 2 

RFIKMCWCD-SFT--LCRDNKVTRFVSDFKSGQGYN-LC-NFIHALV-GWTC-NLLPK^ 
TSKSSVATRCCDA-LVQDi^AS-KV-PSELW-KCCYTKRMfDECRK\^STVSILKYTYF 
SCTLQHESYSLWCW 

5'3' Frame 3 

GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVET^^ 
QASQAWQPGV^MPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYIOT 
AVPSNMRVIHFGAG 

3*5' Frame 1 

PAPK-ITLMLEGTAKVSVFKY-HS-VYFATFIIIPFGITAFSP-F-RSHFSRSILCILYK 
LGIATPGCHA-LACSFG-KVSTCPSLHQSMNEISA-SIVTLTTFEITDKSCDFIISTKSS 
SKRSITEHTHFDEP 

3*5' Frame 2 

QHQSE-LSCWRVQLK-VYLSIDTVEYTLRHSSLFLLV-QHFHHNSEGHTFQEAFFASCTS 
-ASQHLVATLDLLWLGRRFQHVHPYTKA-MKFQHSQL-P-PLLKSLTNLVTLLSRQSHQ 
VKDQSQNTHIIJyiNL 

3^5' Frame 3 

STKVMSHVGGYS-SKCI-VLTQLSILCDIHHYSFWYNSIFTIILKVTLFKKHSLHLVQV 

RHRM'WLPRLTCL-FWVEGFNMSILTPKHE-NFSIWCl^DHF-NH-QIL-LY^ 

-KINHRTHTF-T 
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FIGURE 127 

5'3' Frame 1 

-WTYPGKMQPRSLVDLFSKRTN-]W--WTPIKPT-CPPHYIWWTHRFN-Q-PEWR 

GQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEARAFQSTPIWQMTKIATTEELPDE 

FWVTAK-KSSAPD6TSIT-ELAQKLHFPTALTKKASYGLQLREP-IHPKTTLAPAILIT 

l^PPCYNFLKEQHCQKASTQREAEAAWPLLAPHHWAVIQEIQLLAAVGEILLLEWLAE 

WKLPSRYCC-TD-TSLRAKFLVKAMmAKLSLRNLLLRHLKSI^^ 

LGDWQNKPKEISGTKT-SDKELITNIGPQIAQFA 



5'3' Frame 2 



RFLPTQEKPTNLDLL-ICSLNEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGLQW 
GKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWLLPKSYPTS 
SWW-RQNERAQPQMVLLLPR1WPRSFTSLRR~QRRHRMGCN-GSLEYTQRPHWHPQS--Q 
CCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RR 
W-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVORHSSI 
WETWSRTNPRKFRGPRPNQTRN-LQTLGRKLHNLP 



5'3' Frame 3 



GFYLPRKSQPTSISCRSVL-TimKCLIMDPNQTJWVPPALHLVDPQIQLTITRMEDCNG 
ARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTNS6PDDQIGYYRRATRRV 
RGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWATEGAiOTPKDHIGTRNPI^ 
AATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSR6NSPARMASGG 
GETAMLLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAF 
GRRGPEQTQGNFGDQDLIRQGTDYKHWAANCTIC 

3 '5' Frame 1 

RQIVQFAAQCL-SVPCLIRSWSPKFPWVCSGPRLPNA-VTLYCFVAVRFWRGFLDASAAD 
FLVTVWPCCCWPLPETLLSSWFNLSSSNSARAVSPPPLAIRAGEFPLLLPGVEFLELPRL 
RDEEREEA-LPPLLPSA-KPFGNWP-GSCSTVAALLLGLRVPMWSLGVFKAPSVATHTM 
PSLLAP-GSEASGPVPR— KYHLGLSSFILPSPPRTRRVALR— PIWSSGPLLVLIGTPW 
PRGNLSSSLPC-VRAVNQDAILLGKPWGRRCFGLAPLQSSILVIVS-ICGSTKCNAGGTT 
LV-LGSIIRHFNLFV-RTDLQEIEVGWLFLGR-KP 

3'5' Frame 2 

GKLCNLRPWCNQFLV-LGLGPRNFLGFVLDHVSQMLE-RCTVLWQYVFGEAF-iytt^^ 
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S--QFGLVWGLYQKLCSQAGSICLAAIARGQFHHLR-PFEQENFPYCCQELNFIjNY^ 

VmSEKRLDCRLCFPLRRSLLAMLFLEEWARWQHCY-IX^GCQCGLWV^ 

L]X:-RRREVKLLGQFLGNRSTIW6-ALSFCRHHHELVG-LFGSSQFGHLDHYWC-LERPG 

LEGI-VPPCHAE-EL-TKTQYYWVNLGVGAVLALPHCSPPFWLLSVESVGPPN^ 

WFDWGPLSDILICSFREQIYKRSRLVGFSWVGKNL 

3*5' Frame 3 

ANCAICGPMFVISSLSD-VLVPEISLGLFWTTSPKCLSDWLFCGSTFLARLFRCLSSRF 
LSDSLJy:jrjLIJ^TRNFALKLVQSV-QQ-REGSFTTSASHSSRRISPTAARS-IS-ITATT 
--GAREGLTAASASLCVEAFWQCCSLRKL-HGGSIVIRIAGANVVFGCIQGSLSCNPYDA 
FFVSAVGK-SFWASS-VIEVPSGAELFHFAVTTTNSSGSSSWANLVIWTTIGVDT)^^ 
SRESKFLLMILSESCEPRRNIIG-TLGSALFWPCPIAVLHSGYCQLNLWVHQM-CGGHYV 
iGLIGVHYQTF-FVRLENRSTRDRGWLAFPG-VKT 



FIGURE 128 

-GLELKL-LTSICAF-PFCYSLF— CLLYFGFHSKSRI-KNLVPKSKRT-NFSLF-LVFL 

YAVAYAL-YSAVHLINLMCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTW^ 

VTINCQDPAGGALI ARCWYLHEGHQTAAFRDVLWLNKRTN-NV- -WTPIKPT-CPPHYI 

WWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ 

-WSR-P]MLLPKSYPTSSWW-RQNERAQPQMVLLLPR]MPRSFTSLRR-QRRHRMGCN-G 

SLEYTQRPHWHPQS— QCCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKF 

NSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-K 

ASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPNQTRN-LQTLAANCTICSKCLCIL\ft/N 

VTHWHGSHTFGNMADLSWSH-IG-QRSTIQRQRHTAEQAH-RIQNIPTNRA-KGQKEKD- 

-SSAFAAETKEAAHCDSSSC 

EDSSSSFN-LLFVLFSLSAIPCFNNAYYILVFTRNPGSRRTLYQSLNEHETSHCFDLYFS 

MQLHMHCSTALCI--TSCA-RSL-GTTLGVILIALLGFVL-ERFYLFIDGTLWFKHAHLM 

LLSTVKIQLVVRL-LGVGTPMKVTKLLHLETYLLF-INEQIKMSDNGPQSNQRSAPRITF 

GGPTDSTDNNQNGGRNGARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTN 

SGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEG 

ALNTPKDHIGTRNPNNNAATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNS 

TPGSSRGNSPARMASGGGETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKK 

PRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGM 

SRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDWILIJmiDAYKTFPPTEPKKDiaa^ 

EAQPLPQRQKKQPTVTLLP 
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RTRAQALIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV-TIMLLIVLTCIS 
CSCICTWQRCASNKPHVLEDPCKVQH-G-YL-HCIJ^SRKGFTFS-MAHYGSim^ 
YYQLSRSSWWCAYS-VLVPS-RSPNCCI-RRTCCFK-TNKLKCLIMDPNQTNWPPALHL 
VDPQIQLTITRMEDAMGQGQNSADPKVYPIILRLGSQLSLSMARRI^ 

WQMTKIATTEELPDEFVVWAK-KSSAPDGTSIT-EMQKLHFPTALTKKASYGLQLR^ 

P-IHPKTTLAPAILITMLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIO 

LLAAVGEILLLEWIAEVVKLPSRYCC-TD-TSLRAKFLVKA]^^ 

LAKNVLPQNSTTSLKHIX3DWQNKPKEISGTKT-SDKELITNIGRKL^^ 

HAIAWKSHLREHG-LIMEPLNWl^KIHNSKTTSYC-TSTLTOT 

KLSLCRRDKRSSPL-LFFL . ^^^^vi^i 



FIGURE 129 

5*3' Frame 1 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccct 

YRRL I SMMGFKMNYQVNGYP 
^^T^^^,^^^^^^^^^''''^^^^^^^^^'^^^^^^^^^^^^ttcgtgcgtggattggctttgat 

N M F I T R E E A I R . H V R A W I G F D 
gtagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctagga 

V E G C H A T R D A V G T N L P L Q L G 
ttttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataac 

F S T G V N L V A V P T G Y V D T E N N 

acagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 
TEFTRV NAKPPP GDQFKHLI 



5'3' Frame 2 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttacccta 

TVDSSL-WVSK-IT KSMVTL 
atatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatg 

I C L S P A K K L F V T F V R G L A L M 
tagagggctgtcatgeaactagagatgctgtgggtactaacctacctctccagctaggat 

-RAVMQLEMLWVLTYLSS-D 
tttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataaca 

F LQVLT--LYR LVMLTLKIT 
^^^^S*^^o^^^^^^^*^*'^^^^^^^^^^^'^^^^^^^^5*^eraccagtttaaacatcttatacc 

QNSP ELM QNLHQVTSLNI LY 
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5'3' Frame 3 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccctaa 

P-TH LYDGFQNEL P SQWLP- 
tatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgt 

YVYHPRRSYSSRSCVD WL- C 
agagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctag'gktt 

RGLSCN-RCCGY-PTSPARI 
ttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataacac 

FYRC-LSSCTDWLC-H-K -H 
agaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

RIHQS - CKTS TR-PV -TSYT 



3'5' Frame 1 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattct 

G,I R C L NW S P G GG FAL T LVN S 
gtgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaa 

VL FSV ST-PVGTATKLTPV. E 
aatcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctct 

N PS,.WRGRLVPTASLyA-Q P S 
acatcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacata 

TSKPIHART-RIASSRV INI 
ttagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

LG-PLTW-FILK PIIEMSLR 



3*5* Frame 2 

ggtafcaagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctg 

V-DV-TGHLVEVLH- LW- IL 
tgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaa 

CY FQC QHNQ SVQLLS - H L - K 
atcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctcta 

ILAGEVG-YPQHL-LHDSPL 
catcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatat 

HQS QSTHER DE-LLRG--TY 
tagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

-GNH-L6NSF-NPS-R-VYG 
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3'5' Frame 3 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgt 

YKMFKLVTWWRFCINSGEFC 
gttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaa 

V I F S V N I T S R Y S Y .- V N T C R- K 
tcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctac 

S-LER-VSTHSISSCMTALY 
atcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatatt 

IKANPRTNVTNSFFAGDKH I 
agggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

RVTIDLVI HFETH HRDESTV 

FIGURE ISO 

10 20 30 40 50 60 

SEQ ID NO: 10034 YRRLISMGFKMNYQWGYPNMFITREEAIRHVIU^WIGFDVEGCHATRDAVGTN^^ 

Prim. Cons. " kghd2rrlismgfkmnyqvngypnmfitreeairh™wigfdvegchatr^^ 

70 80 90 100 110 120 

Mn'inn?I J'QJ^^f GVM.VAVPTGYVDTEI^KFIT^LqTS^ 
SEQ ID NO: 10034 LQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

Prim. cons. LQLGFSTGmVAVPTGYVDTENNT2FTRVNA222222QFKHLIPI^ 

140 . 150 160 170 180 

lln S Mn'innol ^Q^^^^^^^SDRVVmWAHGFELTSMLvKIGPmiTCCLCDK^^^ ' 
ony ID NO: 10034 ; 

Prim. cons. VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFmGPERTCCLCDmTCFSTSS^^ 
190 200 

SEQ ID NO: 9997 WNHSVGFDYVYNPFMIDVQQWGLYG 
SEQ ID NO; 10034 
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FIGURE 13i 

5 '3' Frame 1 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgaga 

QVHQNV CVL-LIFYLMTLSR 
taataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatg 

- - SHKI CQ - F QKWS RLQLTM- 
ctgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaac 

LKFHSCFGVRMDMLKPSTQN 
tacaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaa 

YKQVKRGNQVLRCLTCTRCK 
gaatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaag 

EC FLKSVTFRIM VKMLLY QK 
gaataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttactt 

E---MSQSILNCVNT-IHLL 
tagctgtaccctccaacatgagagttattcactttggtgctgg 

- LYPPT-ELFTLVL 



5'3' Frame 2 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagat 

R FIKMCVFCD - S FT - - LCRD 
aataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgc 

N K V T R F V S D F K S G Q G Y N - L C 
tgaaatttcattpatgctttggtgtaaggatggacatgttgaaaccttctacccaaaact 

-NFI HALV-GWT C-N LLPKT 
acaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaag 

TSKSSVATRCCDA-LVQDAK 
aatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagg 

NAS - KV - PSELW - KCCY T KR 
aataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttt 

NNDECRKV YSTVS ILKYTY F 
agctgtaccctccaacatgagagttattcactttggtgctgg 

SCT LQH^SYSLWCW 



5'3' Frame 3 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagata 

GSSKCVCSVIDLLLDDFVE I 
ataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgct 

IKSQDLSVISKVVKVTIDYA 
gaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaacta 

EIS FMLWCKDGHVETFYPKL 
caagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaaga 
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QASQ AWQPGVA MPNLYKM QR 
atgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagga 

MLLEK CDLQNYGENAVIPKG 
ataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttta 

IMMNVAKYTQLCQYLNTLTL 
gctgtaccctccaacatgagagttattcactttggtgctgg 

AVPSNMR VIHFGA 



3'5* Frame 1 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaag 

P A PK-ITLMLEGTAKVSVFK 
tattgacacagttgagtatactttgcgacattcatcattattccttttggtataacagca 

Y-HS-VYFATFIIIPFGITA 
ttttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaag 

FSP-F-RSHFS RSILCILYK 
ttaggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtt 

LGIATPGCHA - LACSFG- KV 
tcaacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacc 

STCPSL H QSMN EISA-S IVT 
ttgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatca 

LTTFEITDKSCDFIIS TKSS 
agtaaaagatcaatcacagaacacacacattttgatgaacctg 

SKRSITEHTHFDEP 



3 '5' Frame 2 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagt 

Q HQSE-LSCWRVQLK-VYLS 
attgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcat 

IDTVEYTLRHSSLFLL V- QH 
tttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagt 

FHHNSE6HTFQEA FFASCTS 
taggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggttt 

-AS QHLVATLD LLVVLGRRF 
caacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacct 

QHVHPYTKA-MKFQHSQL-P 
tgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaa 

-PLLKSLTNLVTLLSRQSHQ 
gtaaaagatcaatcacagaacacacacattttgatgaacctg 

VKDQSQNTHILMNL 



3'5' Frame 3 
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ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagta 

STKVNNSHVGGYS-SKCI-V 
ttgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcatt 

LTQLSILCDIH HYSFWYNSI 
ttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagtt 

FT I ILKVTLFKKHSLHLVQV 
aggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtttc 

RHRNTW LPRLTCL-FW VEG F 
aacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacctt 

NMSILTPKHE-NFSIVNC NL 
gaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaag 

DH F - NH - Q I L - L Y YLD KV I K 
taaaagatcaatcacagaacacacacattttgatgaacctg 

-KI NHRT HTF--T 



FIGURE 132 

5'3' Frame 1 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctct 

-V FTY PGKA NQ P RSLVDL F S 
aaacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccg 

KR TN-NV--WTPIKPT -CP. P 
cattacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatg 

HY IWWTHRFN - Q-PEWRTAM 
gggcaaggccaaaacagcgccgaccccaaggtttacccaataatatfcgcgtcttggttca 

GQGQNSA DPKVYPIILRLGS 
cagctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaa 

QL SLS MARRNLDSLEARAFQ 
tcaacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgag 

STPIVVQMTKLATTEELPDE 
ttcgtggtggtgacggcaaaatgaaagagctcagcGccagatggtacttctattacctag 

F VVVTAK-KSSAPDGTSIT- 
gaactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttg 

ELAQKLHFPTALTKKASYGL 
caactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataaca 

QLREP-IHPKTTLAPAILIT 
atgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcag 

MLP PCYNFLK E Q HCQKASTQ 
agggaagcagaggcggcagtcaagcctcttctcgctcctcatcaegtagtcgcggtaatt 

REAEAAVKPLLAPHHVVAVI 
caagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggag 

QE IQLLAAVGEILLLEWLAE 
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gtggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaag 

VV KLPSRYCC-TD-TS LRAK 
tttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgagg 

FLVKANNNKAKL SLRNL LLR 
catctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcat 

HLKS LAKNVLPQNSTTSLKH 
ttgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagac 

LGDVVQNKPKEISGT KT - SD 
aaggaactgattacaaacattgggccgcaaattgcacaatttgcct 

KELITN IGPQIAQ FA 



5*3 • Frame 2 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctcta 

R F L P T Q EKPTNLDLL-ICSL 
aacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgc 

NEQIKMS DNGPQSNQRSAPR 
attacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatgg 

ITF GGPTDSTDNNQNGGLQW 
ggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcac 

GKAKTAPTPRPTQ-YCVLVH 
agctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaat 

SSHSAWQGGT - IPSRPGR S.N 
caacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagt 

Q H.Q - W S R - P N W L L P K S Y P T S 

tcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagg 

SWW-RQNERAQPQMVLL LPR 
aactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgc 

NWPRSFTSL RR-QRRHRMGC 
aactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaa 

N-GSLEYTQRPHWHPQS- -Q 
tgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcaga 

CCHRATTSSRNNIA KRLLRR 
gggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattc 

GKQRRQSSL F S LLIT-SR-F 
aagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggagg 

KK FNSWQQ- GKFSCSNG-RR 
tggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagt 

W-NCPRAIAARQIEPA-EQS 
ttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggc 

FW-RPTT TRP NCH-EICC - G 
atctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcatt 

I-K ASPKTYCHKTVQRHSSI 
tgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagaca 

WETWSRT NPRKFRGP RPN QT 
aggaactgattacaaacattgggccgcaaattgcacaatttgcct 

RN-LQTLGRKLHN LP 
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5*3' Frame 3 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctctaa 

GFYLPRKS QPTSISCRSVL- 
acgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgca 

TNKL KC L I MD P NQ TNVV P P A. 
ttacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatggg 

LHLVDPQIQLTITRMEDCN G 
gcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcaca 

ARPKQRRPQGLPNNIASWFT 
gctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaatc 

ALTQH6KEELRFPRGQGVPI 
aacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagtt 

NTNSG PDDQ I GYYRRATRRV 
cgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagga 

RGGDG KMKELSPRWYFYYLG 
actggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgca 

TG PEAS LPYGANKEGIVWVA 
actgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaat 

TEGALNTPKDHIGT RN PN N N 
gctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcagag 

AATVLQLPQGTT.LP:KG FYAE 
ggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattca 

G SR GGSQAS S RS S S RS R G NS 
agaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggaggt 

RNSTPGSSRGNSPARMASG.G 
ggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagtt 

GET A LALLLLDRLNQLESKV 
tctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggca 

SGKGQQQQG QTVT KKSAAEA 
tctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcattt 

SKK PRQKRT ATKQYNVTQAF 
gggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagacaa 

GRRGPEQTQGNFGDQDLIRQ 
ggaactgattacaaacattgggccgcaaattgcacaatttgcCt 

GTDYKHW AANCTI C 



3 '5' Frame 1 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtct 
RQIVQFA AQCL-SVPGL IRS 

tggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacg 
WSPKFPWVCSGPRLPNA-VT 

ttgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagat 
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L YCFVA V RFWRGFL DASA AD 
ttcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagc 
F L V T V W P C C C W P L P E T L L S S 
tggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccatt 
WFNLSSSNSARAVSPPPL AI 
cgagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgacta 

R A G E F P L L L P G V E F L E L P R L 
cgtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcGt 
^. .^A-I-P PLLP S A-K P 



V T^f ''^^'^^^^^^^^^^^^^^^^^^tggcagcattgttattaggattg 
FGNVVP-GSCSTVAALLLGL 
cgggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatg 

RVPMWSL6VFKAPSVATHT M 
ccttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaag 

PSLLAP-GSEASGPVPR— -K 
taccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagct 

YHLGLSSFILPSPPRTRRV A 
cttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctgg 

L R--PIWSS.GPLLVLIGTPW 
cctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgca 

^^^^SSSLPC -VRAVNQDA 
atattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcc 
• J-L LGKPWGR RCFGLAPLQSS 

attctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggg^ 
■ J ul ^^-^CGST KCNAGGTT 

ttggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatcta 

L V-LG SIIRHFNLFV-RTDL 
caagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 
Q=EIEVGWLFLGR-K.P 

3*5» Frame 2 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtctt 

GKLCNLRPNVCNQFLV-LGL 
ggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgt 

G PRNFLGFVLDHVSQMLE-R 
^f^f^f^^^^f^^^^^^^^^tttggcgaggctttttagatgcctcagcagcagatt 
4-^.7 ^^^^QYVFGEA F-MPQQQI 
tcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagct 

S - - QFGLVVVGLY QKLCSQ A 
ggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattc 

GSICLAAIARGQFHHLR-PF 
gagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactac 

EQENFPYCCQE LNFLNYRDY 
gtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcctt 

VMR SEK RLD CRLCF PLRRSL 
ttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgc 
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LAML FLEEVVARWQHCY-DC 
gggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgc 

GCQCGLWVYS RLPQLQPIR C 
cttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagt 

L LC - RRREVKL LGQFLGNRS 
accatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctc 

TIWG~ALSFCRHHHELVG-L 
ttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggc 

FGS SQF GHLDHYWC - LERPG 
ctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaa 

LEGI-VPPCHAE-EL-TKTQ 
tattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcca 

YYWVNLGVGAVLALPHCSPP 
ttctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgt 

FWLLSVESVGPPNVMRGALR 
tggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctac 

WFDWGP LS DILICSFREQIY 
aagagatGgaggttggttggcttttcctgggtaggtaaaaaccta 

K. R SRLVGFSWVGKNL 



3'5' Frame 3 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtcttg 

A'NCAI C GPMFV I S S L SD - VL 
gtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgtt 

VPEISLGLFWTTSPKCL SDV 
gtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagattt 

VLFCGSTFLARL FRCLSSRF 
cttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagctg 

LSDSLALLLLAFTRNFALKL 
gttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattcg 

VQSV-QQ-REGSFTTSASHS 
agcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactacg 

SRRIS PTAARS - IS - ITATT 
tgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagccttt 

--G ARRGLTAASASLCVEAF 
tggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgcg 

W QCCSLRKL- HGGSIVIRIA 
ggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgcc 

GANVVFGCIQG SLS CNPYDA 
ttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagta 

FFVSAVGK-SFWASS- VIEV 
ccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctct 

PSGAEL FHF AVTTTNS SGSS 
tcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggcc 

SVVAN L V I WTT I 6 V DWNA LA 
tcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaat 



BNSDOCIO: <W Q a 0O4O9g3e0A2 I > 
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SRESKFLLAMLSESCEPRR N 
attattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctccat 

II G-TLGSALFWPCPIAVLH 
tctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgtt 

SGYCQLNLWVHQM-CG6H YV 
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99-104(part1ally). 105. 106-107.(partiklly). los! 109. 114-120(pa|It1ally) 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 



Invention 1 



claims: l-12(parti ally), 13(part1 ally), 14, 15, 
16-21(partiany). 22-28(partiany). 73-77(partial ly), 
79(part1any), 83(part1ally), 85-93(partially), 94-98, 
99-104(part1 ally), 105, 106-107(part1ally), 108, 109, 
114-120(partially) 

The polypeptide is a spike (S) polypeptide encoded by SEQ. 
ID n 6042, nucleic acid encoding the spike protein, 
fragments thereof, antibodies specific for the spike 
protein, immunoassays using these antibodies, a vaccine 
comprising a spike protein, a viral vector comprising the 
protein, an immunogenic fragment thereof, double stranded 
RNA thereof, the recombinalnt expression thereof and the 
medical use thereof. 



Invention 2 



cl aims : l-12(part 1 al ly) , 13(part 1 al ly) , 16-21(part1 al ly) , 
22-28(part1ally), 73-77(part1ally) , 79(part1ally). 
83(part1ally), 85-93( parti ally), 99-104(partiany), 
106-107(partiany), 114-120(part1any) 
the polypeptide Is an envelope (E) polypeptide encoded by 
SEQ. ID n 6045, nucleic acid encoding the E protein, 
fragments thereof, antibodies specific for the E protein. 
Immunoassays using that antibody, a vaccine comprising an E 
protein, a viral vector comprising the protein, double 
stranded RNA thereof the recombinant expression thereof and 
the medical use thereof. 



Invention 3 



cl alms : 1-I2(part1 ally ) . 13(part 1 al ly) . 16-21(part1 al ly) . 
22-28( parti ally), 73-77( parti ally), 79(part1ally), 
83(part1ally), 85-93(part1any), 99-104(part1any), 
106-107(part1 al ly) , 114-120(part1al ly) 
the polypeptide Is a membrane (M) polypeptide encoded by 
SEQ. ID n 6046, nucleic acid encoding the M protein, 
fragment thereof, antibodies specific for the M protein, 
immunoassays using that antibody, a vaccine comprising a M 
protein, a viral vector comprising the protein, double 
stranded RNA thereof, the recombinant expression thereof and 
the medical use thereof. 
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• l-12(parti al ly) , 13(parti ally) , 16-21(parti al 1y) , 
22-27(part1ally), 73-77(parti al ly) , 79(partial ly), 
83(partially), 85-93(parti al ly) , 99-104(part1al ly) 
106-107(partially), 114-120(part1al ly) 
the polypeptide is a hemagglutinin esterase (HE) 
polypeptide, nucleic acid encoding the HE protein, fragment 
thereof, antibodies specific for the HE protein. 
Immunoassays using that antibody, a vaccine comprising a HE 
protein, a viral vector comprising the protein, double 
stranded RNA thereof, the recombinant expression thereof and 
the medical use thereof. 



Invention 5 



ol*l!IIf-^"^^^P^^*^^lly>'13(part1ally),16-21(part1ally), 
22-28(partially), 73-77(part1al ly). 79(partially). 
83(part1ally), 85-93{partially). 99-104(part1any). 
106-107(partiany), 114-120(partial ly) 

IS« Polypeptide Is a nucleocapsid (N) polypeptide encoded by 
SEQ. ID n 6052, nucleic acid encoding the N protein, 
fragment thereof, antibodies specific for the N protein 
immunoassays using that antibody, a vaccine comprising a N 
protein, a viral vector comprising the protein, double 
stranded RNA thereof, the recombinant expression thereof and 
the medical use thereof. 



Invention 6 



oI^l^Df • ^"^^^P^""*^' ^1 • 13(part1al ly) , 16-21(part1 al ly) , 
22-27(part1ally), 73-77(part1al ly) , 79(part1ally), 
83(part1ally), 85-93(part1ally) , 99-l04(part1ally) 
106-107(part1ally), 114-120(partial ly) 
The ORFla polypeptide encoded by SEQ. ID n 6039, the 
preoteolytic fragments thereof such as NSPl-Nsp-7 
corresponding to SEQ. ID n 9766-9774), nucleic acid encoding 
these proteins, fragments thereof, antibodies specific for 
these proteins. Immunoassays using these antibodies, a 
vaccine comprising the protein, a viral vector comprising 
the protein, double stranded RNA thereof, the recombinant 
expression thereof and the medical use thereof. 
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c1 aims :l-12(parti ally), 13(parti ally), 16-21(part1 ally), 
22-27(partially), 73-77(part1ally), 79(part1ally), 
83(partially), 85-93(part1 ally) , 99-104(partially), 
106-107(partially), 114-120(partial ly) 
The ORFlab polypeptide encoded by SEQ. ID n 6041, the 
preoteolytic fragments thereof such as NSP9-Nsp-13 
corresponding to SEQ. ID n 9775-9779), nucleic acid encoding 
these proteins, fragments thereof, antibodies specific for 
these proteins, immunoassays using these antibodies, a 
vaccine comprising the protein, a viral vector comprising 
the protein, double stranded RNA thereof, the recombinant 
expression thereof and the medical use thereof. 



Invention 8 

claims: 22(partially). 29-58, 84, 
Il0-ll3(part1 al ly) , 114-118( parti ally) : 

A vaccine comprising an Inactivated/attenuated SARS virus, a 
method of inactivating the SARS virus, a method of making an 
inactivated SARS vaccine 



Invention 9: 



claims: 77(part1ally), 78, 79(partial ly), 80-82, 83 
(patially),110-113(partially), 119-120 (partially) 
A method of treatment of a patient suffering from SARS and a 
method of Identifying a therapeutically active agent 
comprising measuring the attenuation of a SARS related 
enzyme, a method of treatment using a therapeutical agent of 
claims 77-82. 



Invention 10-7760 



claims :59-72(partially) 
A single-stranded oligonucleotide selected from the group 
consisting of the SEQ. IDs 21-6020, 6076-6568, 6586-6587, 
7292-7301, 7325-7328, 7332-7352, 7353-7385, 10235-10298, 
10352-10504, 10580-11322. 11325-11551 (taken from the list 
of claim number 59), PCR kit comprising these primers, a 
method of detecting the presence of SARS virus in a sample 
using PCR. 
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